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ABSTRACT 


The  relative  effectiveness  on  decision  makers' 

performance  when  Integrating  color  functionally  into  the 
mixed  graphics  of  a  Decision  Support  System  (DSS)  was 
Investigated.  The  method  of  research  was  a  laboratory 
approach  using  a  man-machine  experiment.  The  key  factors 
of  the  experimental  design  were  determined  by  the 

components  of  Mason  and  Mltroff's  (1973)  definition  of  a 
DSS.  The  decision  makers  had  academic  engineering 
backgrounds.  The  performance  measures  Included  3  levels 
of  decision  time  and  12  levels  of  decision  quality.  Seven 
characteristics  of  decision  makers  were  included:  age, 
sex,  college  level  and  experience,  attitude  toward  the 
computer,  and  experience  with  computers  and  video  games. 
Five  alternatives  of  colors  were  functionally  Integrated 
into  mixed  graphics  (text  and  graphics)  displays:  four 
chromatic  and  one  monochrome.  The  decisions  represented  a 
class  of  problems  which  were  structured  and  had 

deterministic  data.  The  interface  employed  was  computer 
generated  hardcopy.  The  organizational  context  was 

managerial  control  of  resources.^^ 

One  hundred  eleven  hypotheses  were  formulated.  The 
data  collected  on  the  performance  of  120  subjects  who 


worked  an  hour  long  engineering  inventory  case  study  were 
rigorously  analyzed  to  gain  Insight  on  the  effects  of 
color. 

Of  the  86  models  which  measured  decision  quality,  78 
contained  insufficient  evidence  to  conclude  chromatic 
graphics  displays  differentially  affected  performance. 
The  typically  used  CRT  colors  correlated  with  the  best 
performance  in  the  majority  of  the  models  which  supported 
differential  performance.  This  CRT  alternative  averaged  a 
9.2%  nonsign  if leant  increase  in  decision  quality  over  the 
second  place  monochrome  alternative.  Only  1  of  the  25 
models  which  measured  time  contained  sufficient  evidence 
to  conclude  color  treatment  differentially  affected 

performance.  A  chromatic  alternative  ranked  first, 
supporting  a  nonsignificant  2.7%  Increase  in  performance 
over  the  second  ranked  monochrome  alternative. 

These  results,  though  inconsistent  with  the 
theoretical  and  subjective  reports,  are  concurrent  with 
other  related  empirical  research.  Color  does  not  have  a 
unique  quality  for  functionally  coding  Information  in  a 
OSS.  If  color  is  incorporated  into  a  DSS,  those  colors 
currently  preset  on  many  CRTs  appear  best.  However 
increased  efficiency  in  a  decision  maker's  performance 
attributable  to  functional  color  can  not  be  assumed. 
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I.  THE  PROBLEM  AND  ITS  BACKGROUND 


Introduction 

Computer  generated  graphics  supporting  the  option  of 

color  have  become  the  heart  of  many  decision  support 

systems  (DSS)  in  the  industrial,  business  and  military 

sectors.  Technological  advances  in  the  microelectronic 

industry,  an  increasing  reliance  on  computers  by  decision 

makers,  and  the  affordability,  availability  and  usability 

of  the  hardware,  software  and  servicing  have  made  these 

systems  proliferate  steadily.  However  these  systems 

have  been  growing  seemingly  in  an  ad  hoc  manner. 

While  support  and  praise  of  these  systems  has 
been  relatively  high,  ft  appears  that  efforts 
have  fallen  short  of  rea 1 1 z 1 ng  max 1 mum 
effectiveness  and  efficiency  for  the  system 
designed  (Dickson,  Senn  and  Chervany,  1977,  p. 

913) . 

The  technical  development  of  the  color  graphical  displays 
Is  ahead  of  reported,  quantitative  information  regarding 
its  contribution  to  the  DSS.  The  challenge  remains  to 
ascertain  Its  relative  effectiveness  on  user  performance 
(Christ  and  Corso,  1983;  Durrett,  1979;  Haber  and 
Wilkinson,  1982;  Hanson,  1979;  Leith  and  O'Shea,  1980; 
Riesenfeld,  1978;  Tull  is,  1981;  Wallace,  1981). 

Background 

Color  coding  has  been  widely  used  as  a  method  of 
information  input  code  for  visual  displays  for  over  25 
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years.  However,  empirical  evidence  and  guidelines  on  how 
to  use  It  effectively  are  sparse  in  the  literature  with 
the  reports  very  narrow  in  scope.  In  fact  the  literature 
has  Just  begun  to  address  how  color  Impacts  DSS. 

After  an  extensive  review  and  analysis  of  the 
literature  on  color  in  visual  displays,  the  literature  was 
categorized  under  one  of  three  headings:  theoretical 
support,  subjective  reasoning  and  objective  experimental 
findings.  The  theoretical  and  subjective  literature  are 
concurrent  in  their  reports:  color  allows  for  relatively 
more  efficient  processing  of  Information,  thus  improves 
performance  over  use  of  a  monochrome  display  (DeMars, 
1975;  Friend,  1980?  Miller,  1982;  Morris,  1979; 
T  r uckenbroad ,  1981). 

The  majority  of  the  empirical  studies  based  their 
conclusions  on  analysis  of  performance  of  the  simple 
discrete  tasks  of  search,  locate.  Identify  and  count  type 
tasks.  Memory  was  Included  in  only  three  studies.  The 
environment  was  an  Isolated  laboratory  setting  that  often 
simulated  an  aircraft  cockpit.  The  alternative  codes  to 
which  color  was  compared  for  relative  effectiveness 
Included  letters,  numerals,  geometric  and  abstract  shapes. 
The  subjects,  whose  performance  constituted  the  data  for 
analysis,  were  usually  lacking  in  experience  with  the 


given  task. 


Without  exception  the  measure  of  performance 


was  speed  and/or  accuracy. 

Given  these  conditions,  the  findings  in  the 
experimental  literature  are  not  consistent  with  that  from 
the  theoretical  and  subjective  reports.  The  objective 


results  can 

be 

summar i zed 

by 

stat i ng 

the 

relat i ve 

effectiveness 

of 

color  is 

a 

function 

of 

the  task 

undertaken,  the  other  display  conditions  and  the  measure 
of  performance.  Generally,  color  was  not  found  to  have  a 
unique  quality  for  coding  information  in  a  display  over 
other  codes  (Barker  and  Krebs,  1977;  Christ,  1977; 
Chrfstner  and  Ray,  1961;  Cook,  1974;  Tull  is,  1981). 

Gaos  in  Knowledge 

There  continues  to  remain  glaring  gaps  in  the 
literature  on  the  use  of  color  In  visual  displays, 
especially  those  which  do  not  Involve  search,  count, 
locate  and  Identify  types  of  tasks.  With  the  computer 
graphics  penetrating  almost  every  sector  of  the 
technological  world,  visual  displays  are  being  used 
everywhere  for  a  variety  of  activities  which  involve 
information  processing.  One  of  the  fastest  growing 
sectors  of  users  are  managers  and  decision  makers  who  are 
allocating  their  resources  on  incorporating  color  graphics 
into  their  OSS  (Berk,  Brownston  and  Kaufman,  1982;  McLamb, 


1982).  Will  color  on  their  visual  displays  aid  managers 
in  their  primary  function  of  making  decisions  at  different 
levels?  The  subjective  literature  purports  ft  will.  It 
advocates  color  will  speed  up  the  decision  making  process 
significantly  as  well  as  allow  for  better  management 
recall  or  memory,  thus  better  decisions  (Durrett,  Zwiener 
and  Freund,  1981;  Friend,  1980;  Miller,  1982;  Morris, 
1979).  For  most  decision  makers  time  and  accuracy  are  two 
of  their  most  coveted  resources  (Kelley,  1979;  McLamb, 
1982;  Miastkowski  and  Wrege,  1981;  Takeuchi  and  Schmidt, 
1980;  Watson  and  Driver,  1983;  Zmud,  1979). 

However,  this  question  of  whether  color  does  allow 
for  quicker  and/or  better  decisions  has  not  been 
rigorously  addressed,  thus  remains  unanswered.  Only  two 
studies  were  found  that  involved  color  and  graphics  in  a 
information  system  (IS)  environment.  Dooley  and  Harkins 
(1970)  investigated  the  effect  color  had  on  attention  and 
memory,  when  Incorporated  into  a  graphics  display  that  sat 
in  the  corner  of  the  experiment  room.  The  subjects  did 
not  realize  the  graphics  display  was  part  of  the 
experiment  until  it  was  removed  from  the  room  and  the  post 
test  administered  to  collect  memory  data.  The  data  from 
this  study  contained  Insufficient  evidence  to  conclude 
color  affected  memory  performance.  However,  it  did  affect 


attention  time. 


Tullis  (1981)  had  his  subjects  perform  a  training  and 
diagnostic  task  involving  the  status  of  telephone  lines. 
The  graphics  were  line  schematics  displayed  on  a  CRT 
Interface.  Conditions  of  the  telephone  lines  were  coded 
by  either  an  achromatic  shape  or  chromatic  underlining  bar 
(red,  yellow,  green).  The  task  involved  decision  making 
on  a  low  level  and  no  memory.  Based  on  his  analysis  of 
the  data,  he  concluded  that  there  was  no  significant 
differences  in  time  or  accuracy  performance  between  the 
two  graphics  conditions  of  the  color  underlining  bar  and 
the  achromatic  shapes. 

Problem  Statement 

Color  graphics  technology  is  here  and  Is  being  used 
extens i ve l y  in  OSS .  However  there  still  rema 1 ns  much  to 
be  learned  regarding  when  and  how  It  can  be  used  to 
increase  a  decision  maker's  performance.  To  realize  what 
benefits  are  gained  by  I ncorporat I ng  color  into  the 
graphics  displays  of  a  DSS,  quantitative  information  Is 
needed  on  the  relative  effectiveness  color  has  on  the 
performance  of  the  decision  maker.  The  research  reported 
here  was  designed  in  attempts  to  narrow  this  void  of 
quantitative  Information  on  the  relative  impact  chromatic 
graphics  displays  have  on  the  performance  of  the  decision 


i 
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maker . 

Specifically,  the  objective  of  the  reported  research 
was  to  gain  insight  on  how  the  attribute  of  color,  when 
integrated  functionally  into  a  DSS,  affects  the 
performance  of  a  decision  maker.  The  DSS  under 
investigation  involved  decision  makers  with  academic 
engineering  backgrounds.  The  decisions  represented  a 
general  class  of  problems  involving  managerial  control  of 
resources.  The  mode  of  presentation  was  mixed  graphics 
which  were  displayed  on  computer  generated  hardcopy. 

Research  Approach  and  Dissertation  Outline 
This  chapter  concludes  with  the  highlights  of  each  of 
the  chapters  which  follow.  The  research  approach  is 
developed  throughout  this  outline  of  the  dissertation. 
Literature  Search  (Chapter  II) 

Initially  the  literature  was  searched  to  determine 
what  had  been  reported  in  the  area  of  the  impact  color 
displays  had  on  human  performance.  This  was  accomplished 
without  preference  to  type  of  performance  measured,  visual 
display  interface  used  or  chromatic  and  achromatic  codes 
compared.  Studies  based  on  theoretical,  subjective  and 
objective  analyses  were  all  included. 

The  Management  Information  System  (MIS)  literature 
was  also  searched  for  two  reasons.  First,  it  was  examined 


7 


to  glean  what  had  been  reported  on  the  topic  of  color 
and/or  graphics  displays  and  OSS.  Second  it  was 
scrutinized  to  discover  a  taxonomy  that  had  been  validated 
and  recommended  for  use  in  DSS  research. 

Based  on  the  void  discovered  in  the  literature  in  the 
quantitative  knowledge  of  the  effects  the  technology  of 
color  has  on  the  performance  of  a  decision  maker  with  an 
engineering  background,  the  problem  statement  was 
formulated.  Chapter  II,  Literature  Search,  delineates  and 
summarizes  the  studies  reviewed,  defines  the  DSS  and 
addresses  the  areas  where  more  research  is  needed. 
Methodology  (Chapter  III) 

The  MIS  literature  recommends  research  be  developed 
through  many  small  studies  (Lucas,  1981)  and  in  particular 
done  In  a  laboratory  environment  which  allows  for  controls 
(Dickson,  Senn  and  Chervany,  1977).  Since  the  objectives 
of  this  research  could  be  met  within  a  framework  of  a 
1 aboratory  approach ,  an  exper 1 ment  was  des t gned  using  this 
approach . 

The  key  parts  of  a  DSS  given  in  Mason  and  Mitroff's 
(1973)  definition  are  a  person  of  a  certain  psychological 
type,  a  class  of  problems,  data  of  a  certain  form,  a 
particular  interface  and  an  organizational  setting.  The 
DSS  Investigated  In  this  study  was  composed  of  decision 


makers  with  academic  engineering  backgrounds.  They  solved 
a  basic  engineering  case  study  which  involved  structured 
problems  and  deterministic  data.  The  data  were  presented 
in  mixed  graphics  displays  using  a  layout  widely  employed 
among  decision  makers.  The  interface  was  computer 
generated  hardcopy.  By  Anthony's  criterion  (liitroff  and 
Mason,  1977),  the  organizational  setting  was  one  of 
managerial  control  of  resources.  These  components  were 
the  key  factors  behind  the  design  of  the  experiment. 

Chapter  Ill,  Methodology,  describes  the  experiment  in 
detail.  It  discusses  the  sampled  population  of  Arizona 
State  University  (ASU)  students  and  their  character i st ic 
factors  of  age,  sex,  college  level  and  experience, 
attitude  toward  the  computer,  experience  with  computers 
and  video  games  which  were  of  interest  to  this  research. 
It  discusses  and  Justifies  the  use  of  the  selected 
Inventory  case  study  employed  as  the  Instrument  for  data 
collection.  The  mixed  graphics  display  or  mode  of 
presentation  is  then  described  and  presented  in  Figure 
3.1,  with  the  four  color  alternatives  and  one  monochrome 
intensity  pattern  defined.  The  reasons  for  choosing  the 
hardcopy  interface  are  enumerated.  The  organizational 
context  of  managerial  control  is  considered. 

This  chapter  also  Identifies  the  performance  measures 


of  time  and  quality  of  decision,  the  two  most  coveted 
resources  of  decision  makers.  These  two  are  subdivided 
Into  15  categories,  3  which  are  a  function  of  time  and  12 
wh 1 ch  are  a  f unct 1 on  of  qua  1 1 ty  of  dec  1 s 1  on .  Each  type  1 s 
operationally  defined  as  it  pertains  to  this  research. 

Pretesting  of  the  instrument  and  the  experimental 
design  are  presented.  The  procedures  and  results  are 
given.  The  Methodology  chapter  concludes  with  the 
experimental  procedures  whose  objective  was  to  collect 
data,  which,  when  analyzed  properly  would  lend  insight 
into  how  the  attribute  of  color  affects  the  performance  of 
a  decision  maker. 

Analysis  and  Results  (Chapter  IV) 

Analysis  was  done  in  three  areas:  the  problem 
statement ,  the  data  and  the  stat 1 st 1 cs  wh 1 ch  were 
generated  from  the  data.  Chapter  IV,  Analysis  and  Results 
presents  all  three  In  detail. 

Initially  the  problem  statement  was  analyzed.  Using 
15  measures  of  performance  and  7  characteristic  factors  of 
the  sampled  population,  the  problem  statement  was 
subdivided  and  formalized  into  testable  hypotheses  (Table 
4.1,  Research  Formal  Hypotheses)  which  allowed  for 
rigorous  and  Indepth  analyses. 


The  data  was  systematical ly  analyzed,  using  the 
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schematic  shown  In  Figure  4.1,  Analysis  Strategy.  This 
was  done  to  insure  the  2-factor  fixed  effects  ANOVA  model 
used  for  analyses  of  the  research  question  was  not  only 
appropriate  based  on  the  hypothesis  but  also  on  the  data. 
Special  attention  Is  given  to  this  analysis  as  rigorous 
analytic  procedure  is  one  major  shortcoming  noted  in  the 
1 iterature. 

The  Analysis  and  Results  chapter  concludes  with  the 
results  and  conclusions  derived  from  analyzing  the 
statistics  of  the  appropriate  mathematical  model.  Of  the 
111  hypotheses  that  were  tested,  the  data  underlying  the 
majority  of  them  (102)  contained  Insufficient  evidence  to 
conclude  that  any  of  the  five  treatments  (4  color,  1 
black/white)  had  a  significant  differential  effect  on  the 
performance  of  a  decision  maker.  Even  after  the  effects 
of  the  decision  maker's  characteristics  such  as  age,  sex, 
college  level  and  experience,  attitude  toward  the 
computer,  experience  with  computers  and  video  games  were 
taken  Into  account  In  the  variation  of  the  mean 
performance,  the  color  attribute  did  not  test 
statistically  significant  at  the  .05  significance  level. 

For  the  nine  hypotheses  whose  data  contained  evidence 
that  supported  the  contention  that  color  In  a  hardcopy 
mixed  graphics  display  did  affect  the  performance  of 


decision  makers.  in-depth  analyses  were  accomplished. 
Results  of  these  studies  are  displayed  in  Table  4.15, 
Ranking  of  Treatments.  The  trends  among  these  results 
showed  that  the  top  ranking  treatment  was  consistently 
chromatic,  though  not  always  the  same  particular  set  of 
colors.  However,  the  colors  presently  preset  on  a 

majority  of  CRTs  were  consistently  above  average  relative 
to  the  users'  decision  quality  performance.  However,  the 
monochrome  Intensity  pattern  always  rated  second,  and 
never  significantly  different  (<<=.05)  from  the  chromatic 
treatment  ranking  first.  The  seven  cases  where  the  CRT 
colors  out scored  the  monochrome,  the  average  increase  in 
performance  was  approximately  9.2  per  cent.  The  two  cases 
where  the  monochrome  out scored  the  CRT  color,  the  relative 
decrease  in  score  using  the  color  was  about  the  same,  7.0 
per  cent. 

Conclusions  and  Recommendations  (Chapter  V) 

Conclusions,  summaries  and  recommendations  are 
detailed  in  Chapter  V.  Conclusions  and  summaries 
categorized  Initially  by  the  particular  performance 
variable,  then  by  performance  variable  type  (decision 
quality  and  time)  and  finally  with  respect  to  the  general 
problem  statement  are  offered.  This  final  conclusion. 


which  Is  the  overall  contribution  of  this  research,  is 


that  the  attribute  of  color,  when  Integrated  functionally 
in  computer  generated  mixed  graphics  displays  does  not 
consistently  affect  the  performance  of  a  decision  maker 
who  has  an  engineering  background  who  solves  structured 
problems  based  on  deterministic  data  presented  within  a 
managerial  role  of  control  of  resources.  When  performance 
was  affected  in  the  positive  direction,  the  Increased 
effectiveness  relative  to  monochrome  was  always 
statistically  insignificant  with  the  practical  difference 
minimal.  If  color  is  integrated  into  the  mixed  graphics 
display,  the  typically  used  CRT  alternative  appears  to  be 
the  best,  however  a  decision  maker  should  not  assume  It 
will  significantly  Increase  performance. 

This  chapter  and  research  concludes  with 
recommendations  for  further  research.  These 
recommendations  are  based  on  the  recognized  limitations 
within  this  research,  and  the  suggestions  and  voids  in  the 
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I  I.  LITERATURE  SEARCH 


Introduction 

When  organizational  resources  are  expended  for 
incorporating  the  attribute  of  color  into  an  existing  or 
developing  graphics  decision  support  system  (OSS),  what 
benefits  does  the  organization  hope  to  gain  in  return? 
The  literature  has  Just  begun  to  address  this  question. 
However  the  research  on  the  effects  of  using  color  in 
visual  displays  in  other  tasks  has  been  going  on  for  years 
by  noted  Human  Factor  specialists  and  others.  These 
reports  and  findings  can  be  categorized  under  one  of  the 
three  headings:  theoretical  support,  subjective 

reasoning,  and  objective  and  experimental  findings.  This 
literature  review  is  developed  according  to  those 
categories.  The  findings  are  then  summarized  and 
compared,  with  voids  in  the  present  knowledge  highlighted. 
Emphasis  is  then  given  to  the  primary  gap  that  exists  in 
the  knowledge  available  for  managers  who  desire  to 
evaluate  the  benefits  of  color  visual  displays  to  their 
organ  1 zat 1 ona 1  effect  1 venes  s . 

Literature  Findings 

Introduction 

The  literature  search  utilized  the  Arizona  State 


University  computerized  literature  search  service.  The 
data  bases  queried  included  INSPEC,  NTIS,  COMPENDEX,  ERIC, 


Dissertation  Abstracts  and  Microcomputer  Index.  Abstracts 
from  the  queries  were  reviewed  to  determine  the  source 
documents  appropriateness.  Potentially  useful  papers  were 
located  and  read.  Relevant  references  from  these  reports 
were  also  examined.  Where  possible  the  source  document 
for  each  study  found  particularly  key  to  this  research  was 
found  and  critically  reviewed.  In  addition  personal 
communications  were  established  between  the  researcher  and 
several  noted  authors  in  the  Human  Factors  field  of  color 
research:  Richard  Christ,  Thomas  Tull  is  and  H.  Rudy 

Ramsey.  Their  personal  critically  annotated 

bibliographies  were  provided  to  complete  the  literature 
search. 

These  efforts  yielded  findings  in  the  relevant  topics 
of  the  use  of  color  in  information  systems  and  the  need 
for  research  in  the  area  of  DSS  that  incorporate  the 
option  of  color.  What  follows  is  the  review  of  the 
relevant  literature  on  the  use  of  color  as  an  Information 
code  in  visual  displays.  It  is  developed  by  sections: 
theoretical  support,  subjective  reasoning  and  experimental 
f 1 nd 1 ngs . 


The  integration  of  color  into  the  visual 
imaging  process  expands  our  capability  of 
visualizing  complex  mental  constructs  and 
commun f cat ( ng  sophisticated  ideas  (Truckenbroad, 

1981 «  p.  86). 

During  the  processing  of  information  that  is  linearly 
sequential,  read  left  to  right  and  from  top  to  bottom,  the 
left  hemisphere  of  the  brain  is  activated.  When  the 
attribute  of  color  is  added,  the  right  cerebral  hemisphere 
i s  evoked  to  process  the  i nformat i on  content 
simultaneously  with  the  left  side.  Based  on  this 
biological  concept,  many  predict  information  presented  in 
color  will  be  processed  much  more  efficiently  than  that 
which  is  not  (DeMars,  1975;  Miller,  1982;  Truckenbroad, 
1981).  What  col  or (s)  are  best,  how  much  more  efficient 
the  process  is  and  what  other  variables  impact  this 
processing  is  Just  beginning  to  be  explored  in  information 
systems . 

Subject! ve  Reason i ng 

The  decision  to  use  color  is  often  based  on 
subjective  feelings  rather  than  scientific  findings 
(Karner,  1975).  Until  very  recently  it  was  relatively 


difficult  to  investigate  the  variable  of  color  on  computer 
based  systems  simply  because  of  the  cost  associated  with 
the  system  (Durrett,  1979).  However  people  have  grown  up 


with  the  color  media  and  have  come  to  expect  this 


attribute  of  color  in  other  technology  Including  their 
graphic  displays.  Consumers  are  justifying  their  use  of 
color  systems  based  on  their  and  others'  subjective 
feel i ngs. 

DATAMEDIA,  which  introduced  its  colorscan  color  CRT 
in  1981,  feels  it  is  the  brilliancy  and  clarity  of  the 
color  that  sells  the  product  (Miller,  1982).  Visual 
packaging  is  critical  to  merchandising  and  color  is  an 
integral  part  of  any  high  quality  graphics  package 

(Catalano,  1982;  Judd  and  Wyszecki,  1975,  Needle,  1982). 
People  are  buying  and  using  color  graphics  packages  as 
part  of  their  OSS. 

When  asked  why  color  is  preferred,  several  common 
responses  occur  across  different  populations  of  users. 
Some  feel  color  adds  another  dimension  to  the  graphics 
output  (Kirby,  1979).  Others  contend  it  significantly 
increases  productivity  though  they  have  not  scient if ical 1y 
determined  if  In  fact  color  has  this  effect  (Friend,  1980; 
Miller  1982).  Another  argument  for  color  is  that  it  is 
easier  on  the  eyes  (Christ,  1977;  Miller,  1982).  Many 
Justify  the  use  of  color  through  arguments  of  prestige  and 
motivation  but  have  not  yet  rigorously  analyzed  this 
1 atter  hypothes is  (Belie  and  Rapagnan i , 
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1981;  Durrett 


1979;  Eddy,  1979;  Waller,  LeFrere  and  McDona 1 d-Ross , 
1982).  Eddy  based  this  comment  on  a  study  he  did  In  which 
24  students  used  color  display  terminals  In  a  biology 
laboratory  class.  No  control  group  or  monochrome 
terminals  were  used.  Durrett  (1979)  felt  many  students 
who  previously  were  not  interested  in  computers  were 
attracted  to  them  based  solely  on  the  color  display 
system's  Intrinsic  novelty. 

Some  purport  color  adds  more  information  per 
character  on  the  display,  therefore  making  color  displays 
more  cost  efficient  than  their  monochrome  counterparts 
(Miller,  1982;  Morris,  1979).  Skel I fngs,  who  has  a  patent 
pending  on  electronic  displayed  color  poetry  and  is  widely 
published  in  electronic  communication  periodicals,  feels 
he  gets  "something  like  45  percent  more  information" 
(Miller,  1982,  p.  87)  on  a  color  display.  He  did  not 
attempt  to  verify  his  statement.  Another  claim  presented 
without  research  backing  yet  Is  commonly  accepted  is  that 
information  presented  in  color  Is  recalled  better  (Durrett 
et  al.,  1981).  Another  reason  given  is  that  color  is  more 
natural  thus  making  graphs  easier  and  more  natural  for  the 
eyes  to  follow  (Eddy,  1979).  The  most  prevalent  response 
to  the  question  is  that  color  is  highly  appreciated  and 
aesthetically  pleasing  to  the  user  (Bergman,  1980; 


Carlson,  Parent  and  Csurf,  1979;  Catalano,  1982;  Durrett 
et  al.,  1981;  Eddy,  1979,  TulHs,  1981). 

In  several  experimental  studies,  subjective  opinions 
were  solicited  which  were  not  always  backed  by  the  data 
findings.  In  one  study  pilots  were  exposed  to  two  types 
of  visual  display  systems  (black/white  and  color)  during 
landing  approach  exercises.  They  favored  the  color 
displays.  They  felt  color  resulted  in  greater  pilot 
relaxation  and  decreased  fatigue.  They  also  felt  color 
gave  the  display  a  higher  picture  quality  and  allowed  for 
greater  real  depth  perception  (Chase,  1970).  Subjective 
evaluations  were  the  emphasis  of  this  study. 

Another  study  which  solicited  subjective  responses 
involved  subjects  who  took  part  in  a  simulated  military 
tracking  experiment.  They  reported  feeling  more  secure 
when  using  a  color  coded  display.  They  commented  that 


"color 

he 1 ps . . . ( you ) 

can  see 

the 

overal 1  pattern 

better . 

..color  makes 

It  (task) 

eas 1 er" 

(Munns,  1968,  p. 

1221)  . 

The  objective 

findings 

were  not 

concurrent  with 

the  subjective  comments. 

Color  Is  becoming  a  widely  demanded  attribute  of  many 
information  display  systems.  Unfortunately  the  decision 
and  Justification  to  incorporate  this  attribute  is  more 
often  than  not  based  on  the  subjective  feeling  that  "it 
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couldn't  hurt"  (Karner,  1975,  p.  227)  and  "color  graphics 
are  so  obviously  much  better  than  black  and  white  and 
they're  not  really  that  much  more  expensive  anymore" 
(Miller,  1982,  p.  92). 

E  xper 1 menta 1  Findings 

Investigating  the  effects  of  color  has  fascinated 
researchers  for  over  25  years  and  continues  to  do  so  with 
the  advent  of  the  computer  technology  that  now  supports 
affordable  color  and  the  Increasing  reliance  on  these 
computers  and  their  outputs.  A  majority  of  the  historical 
studies  deal  with  the  Impact  color  has  on  time  and  error 
rate  when  the  human  subject  Is  Involved  in  a  simple  task. 
These  tasks  Included  locating,  searching,  identifying, 
comparing,  counting,  tracking  and  recall  (Barker  and 

Krebs,  1977;  OeMars,  1975;  Keister,  1981).  For  all  of  the 
studies  color  was  one  of  the  two  or  more  codes  under 
investigation.  Code  Is  defined  as  "any  system  of  symbols 
for  meaningful  communication"  (Cook,  1974,  p.  3).  Rear 
projection  slides,  vlewgraphs,  hardcopy  and/or  CRT 

terminals  were  the  common  display  devices  used,  with  the 
first  two  being  the  most  prevalent.  A  display  device  is 
the  "principle  Interface  between  the  observer  and  the 
computer  or  other  device  generating  the  Information" 
(OeMars,  1975,  p.  2-1).  All  studies  reviewed  were  done 


under  laboratory  conditions.  Subjects  for  the  most  part 
were  not  experienced  in  the  tasks  they  undertook. 

The  results  of  the  historical  experiments  can  be 
summarized  by  stating  the  relative  effectiveness  of  color 
is  a  function  of  the  task  undertaken  (Baker  and  Krebs, 
1977;  Christ,  1977;  Christner  and  Ray,  1961;  Cook,  1974; 
Tullis,  1981). 

The  studies  will  be  discussed  In  chronological  ordar 
beginning  with  those  done  in  the  early  1950s.  Where 
available  the  purpose  of  the  research,  the  key  variables, 
the  specific  task,  the  sample  subjects  and  the  conclusions 
are  delineated.  A  summary  by  researcher  (Table  2.1)  and 
by  task  (Table  2.2)  conclude  this  section  complimented  by 
a  comprehensive  analysis  by  Christ,  Corso  and  Teichner  of 
the  studies  to  date. 

Green,  McGill  and  Jenkins,  1953  were  interested  in 
variables  that  affected  response  time  in  a  simple  search 
task.  Initially  they  determined  the  most  important 
variable  is  the  density  of  the  objects  to  be  scanned. 
Using  color  code  of  two  categories  (blue  and  yellow),  they 
found  search  time  was  significantly  decreased  when  color 
was  used  as  a  partially  redundant  code  (Barker  and  Krebs, 
1977;  Cook,  1974;  Jones,  1962). 

Green  and  Anderson,  1956,  used  color  as  a  partially 
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redundant  code  In  a  search  task.  Search  time  was 
significantly  decreased  under  the  condition  that  the 
observer  knew  the  color  code  prior  to  engaging  In  the  task 
(Barker  and  Krebs,  1977;  Cook,  1974). 

Chrlstner  and  Ray,  1961,  wanted  to  determine  the 
relative  effectiveness  on  performance  of  selected  target 
background  coding  combinations  when  used  In  locating, 
counting  and  Identifying  tasks.  The  coding  variables  were 
numerals,  shapes  and  colors.  They  concluded  that  the 
choice  of  target  codes  was  task  dependent.  Color  was 
superior,  though  not  sign  If icantly,  when  used  in  locating 
and  counting  tasks.  It  was  not  for  identifying  tasks. 
Subjective  preference  was  for  color  code  across  all  tasks 
(Chrlstner  and  Ray,  1961). 

Hitt,  1961,  analyzed  the  relative  effectiveness  of 
using  selected  abstract  coding  methods  on  operator 
performance  when  engaged  In  various  tasks.  Operator 
performance  was  measured  using  number  of  correct  responses 
and  speed  of  response  per  experimental  condition.  Two 
sets  of  5  subjects  were  Instructed  to  complete  five  map 
reading  tasks  (Identify,  locate,  count,  compare  and 
verify)  as  rapidly  and  accurately  as  possible.  The  codes 
were  color,  numerals,  letters  and  shapes.  To  Investigate 
both  performance  measures,  time  and  accuracy  were  combined 


in  the  form  of  number  of  correct  responses/minute.  Based 
on  an  analysis  of  variance  of  a  fixed  effect  model  on  the 
mean  correct  response/minute  scores,  Hitt  concluded  that 
color  and  numerals  were  the  two  superior  coding  methods  in 
this  reading  situation.  He  concurred  with  Christner  and 
Ray  that  numerical  coding  was  more  effective  in 
identification  type  tasks  while  color  was  more  effective 
for  the  locate,  count,  compare  and  verify  tasks  (Hitt, 
1961;  Jones,  1962). 

Newman  and  Davis,  1961,  concluded  that  color  coding 
reduced  response  time  in  both  locating  and  decoding  tasks 
(Cook,  1974). 

Schutz,  1961,  studied  the  effect  of  color  and 
black/white  coding  when  subjects  were  engaged  in  point 
reading  and  comparing  tasks.  Red,  yellow,  green  and 
purple  were  the  colors  used.  The  first  task  consisted  of 
reading  vert i ca 1 -ax i s  values  from  horizontal  axis  values 
for  a  given  coded  line.  The  comparing  task  consisted  of 
determining  the  coded  line  with  the  highest  value  on  the 
vertical  axis  for  specified  values  on  the  horizontal  axis. 
Performance  of  ten  subjects  was  measured  as  time  to 
complete  the  task.  Errors  were  few  and  random.  Based  on 
his  analysis  of  variance  on  the  mean  performance  time 
scores,  Schutz  concluded  the  color  only  slightly  improved 


the  rate  for  point  reading  but  not  for  comparing  (Schutz, 
1961 ) . 

Jones,  1962,  recommended  the  task  be  the  criteria 
upon  which  the  decision  be  based  whether  to  use  color 
coding.  Results  from  his  analysis  suggested  color  codes 
are  not  suited  for  tasks  that  demand  rapid  and  precise 
identification  but  are  valuable  in  decreasing  search  time 
In  a  locating  task  (Barker  and  Krebs,  1977). 

Smith  published  his  research  on  the  effects  of  color 
on  speed  and  accuracy  over  various  tasks  from  1962  through 
1965.  His  experimental  designs  and  dependent  variables 
were  similar  across  all  his  studies. 

Smith,  1962,  had  12  subjects  do  300  visual  searches 
on  12  inch  square  slides  that  were  placed  18  inches  away. 
His  analysis  concluded  color  resulted  in  faster  and  more 
accurate  searches  when  the  target  color  was  known 
beforehand.  Neither  the  particular  color  of  target  or 
background  had  any  significant  effect  on  performance 
(Barker  and  Krebs,  1977;  Cook,  1974;  Smith,  1962). 

Smith,  1963,  concluded  that  color  coding  resulted  in 
65%  reduction  In  time  for  visual  search,  697,  reduction  in 
time  for  counting  and  767.  reduction  in  counting  errors. 
His  findings  were  based  on  data  collected  on  12  subjects 
(Barker  and  Krebs,  1977;  Cook,  1974). 


Smith  and  Thomas,  1964,  had  8  subjects  engage  in  a 
counting  task.  The  display  device  was  a  29  fnch  square 
which  was  placed  five  feet  away  from  the  subject.  They 


cone  1 uded 

color 

cod i ng 

resulted 

i  n 

faster  performance 

times  and 

fewer 

errors 

(Barker 

and 

Krebs,  1977;  Cook, 

1974) . 

Smith,  Farquhar  and  Thomas,  1965,  designed  an 
experiment  that  assessed  and  compared  the  effects  of 
symbolic,  numeric  and  color  coding  in  formatted  displays. 
Row  comparison  and  item  counting  were  the  tasks  performed 
by  their  12  subjects.  Based  on  analysis  of  their  mean 
time  scores  and  errors  they  concluded  that  color  coding 
was  sign  if leant ly  superior  to  any  other  coding.  Color 
coding  reduced  performance  time  over  no  coding  by  727.  for 
the  item  counting  task  and  477.  for  the  row  comparison 
task.  Their  overall  generalization  was  the  effect  of 

color  depends  upon  the  Joint  interaction  of  the  format  of 
the  display  and  the  task  to  which  it  is  applied  (Barker 
and  Krebs,  1977;  Cook,  1974;  Smith,  Farquhar  and  Thomas. 
1965) . 

Hoisman,  Hannah,  and  Schaef,  1963,  were  under 
contract  with  the  Air  Force  (AF)  to  provide  an 
experimental  investigation  of  visual  displays  which  would 
lead  to  the  specification  of  information  presentation 
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parameters  associated  with  the  display  of  processed 
information  to  AF  intelligence  officers.  After  completing 
a  thorough  search  of  the  literature  Hoisman  et  al 
discovered  there  was  a  gap  in  knowledge  concerning  visual 
display  presentations  and  their  associated  parameters. 
They  designed  a  project  to  gain  information  that  would 
narrow  this  gap.  This  project  was  designed  in  the  context 
of  the  tasks,  presentations,  parameters  and  mission  of  the 
AF  Intelligence  System. 

Initially  Hoisman  et  al  determined  through  factor 
analysis  of  data  from  surveys  and  interviews  the  display 
type,  the  tasks  and  parameters  most  representat i ve  of  the 
Intelligence  community.  Map  displays  were  the  most 
utilized.  Six  tasks  were  identified  when  using  a  map: 
locate  (i.e.  How  many  cities?),  identify  (i.e.  Is  this 
target  an  air  base?),  extract  (i.e.  What  is  the  city's 
population?),  integrate  (i.e.  What  is  the  target  value  of 
this  city?),  correlate  (.e.  Is  the  strength  of  these  two 
bases  equal?)  and  analyze  (i.e.  What  are  the  rank  order  of 
the  targets?).  Density  and  color  coding  were  determined 
to  be  the  major  display  parameters  as  they  appeared  to 
contribute  a  large  portion  of  the  within  display 
differences  In  performance. 

Using  hardcopy  maps  as  the  display  device. 


experiments  were  developed  to  investigate  gross 

qualitative  indices  of  density  and  the  effects  of  color 
coding  on  subject  performance  when  completing  the  six 
categories  of  tasks.  Four  tentative  hypotheses  were 
proposed,  with  one  of  particular  interest  to  this  study: 
Color  coding  enhances  performance. 

A  pilot  study  was  conducted  to  pretest  the 
experiment.  Volunteers  were  43  males  of  freshman  or 
sophomore  academic  standing  enrolled  in  a  psychology 
course  at  the  University  of  Pittsburgh.  They  were  paid  a 
minimum  hourly  wage.  These  males,  ranging  from  18  to  20 
years  old,  all  passed  the  Dvorine  Psuedo-Isochromatic  test 
for  color  vision  prior  to  the  experiment.  After  a  20 
minute  lecture  familiarizing  the  subjects  with  the  task, 
and  a  practice  session,  each  was  given  an  envelope 
containing  maps  with  a  legend  and  a  list  of  tasks 
(questions)  to  be  completed.  Responses  to  the  questions, 
all  which  had  answers,  were  made  by  writing  on  paper 
visible  through  an  open  window  of  a  mechanical  instrument 
located  in  front  of  them.  This  instrument  insured  no 
answer  could  be  changed.  It  also  recorded  the  time  per 
task  completion.  The  maps  used  were  generic  maps  utilized 
by  the  AF,  altered  by  removing  the  geographic  names  and 
replacing  them  with  randomized  letter  codes.  This 
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controlled  for  any  familiarization  of  the  maps  by  the 
subjects.  The  effects  of  order  presentation  were 

considered  negligible  since  the  materials  and  the  tasks 
were  distributed  within  each  package  in  random  order.  The 
effects  of  location  of  significant  data  on  the  maps  were 
controlled  by  completely  randomizing  the  location  points. 
The  dependent  variables  or  measures  of  performance  were 
speed  and  accuracy.  Speed  was  defined  as  the  mean 
response  time  per  task  (quest ion) .  Accuracy  was  measured 
as  total  number  of  correct  tasks  completed.  The 

independent  variables  were  color,  contrast,  task  and  task 
I  eve l  . 

Using  a  two  factor  experimental  design  and  analysis 
of  variance,  Hoisman  et  al  reported  their  data  supported 
their  hypothesis  that  color  tended  to  reduce  the  effect  of 
high  density.  Correlation  analysis  of  the  data  suggested 
no  significant  correlation  between  speed  and  accuracy. 

Based  on  their  findings  from  their  pilot  study, 
Hoisman  et  al  performed  their  main  experiment  to  evaluate 
the  effects  of  color,  density  and  task  on  the  performance 
of  AF  intelligence  analysts  using  trend  graphic, 
cartographic  and  tabular  displays.  The  volunteers  were  68 
male  college  students  who  were  paid  a  minimum  hours  wage 
to  participate  In  five  3-hour  sessions.  They  were  given 
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the  incentive  of  being  paid  extra  if  chey  performed  better 
than  par. 

The  experiment  was  given  in  a  windowiess  classroom 
illuminated  by  fluorescent  tubes  to  40  footcandles.  Each 
worked  at  a  24x48  inch  flat  top  28  inch  high  desk.  As  in 
the  pilot  study  a  pretest  training  and  practice  session 
was  given.  Each  subject  was  then  given  a  package 
containing  tasks  (3  simple,  3  complex  questions)  and  maps. 
Each  worked  at  his  own  pace. 

The  Independent  variables  were  density  (two 
categories  of  5  levels  each),  color  (chromatic  (red,  blue, 
purple,  shaded  grey)  and  black/white)  and  tasks  (6  types 
previously  defined).  The  dependent  variables  were 

accuracy  and  speed  item  by  item  and  in  total. 

The  underlying  assumptions  stated  were  the  following. 
Their  subjects  did  not  differ  significantly  in  any  of 
their  essential  characteristics  from  AF  Intelligence 
analysts  as  a  group.  The  testing  materials  used  and  the 
principles  followed  in  their  design  were  meaningfully 
related  to  the  actual  materials  used  by  the  analysts. 
Finally  the  tasks  used  in  testing  the  hypotheses  and  the 
methods  used  in  deriving  them  related  to  the  tasks 
performed  by  the  analysts. 

Using  a  complete  5x5x2x6  factorial  design,  a  standard 


analysis  of  variance  technique  was  employed  to  examine 
both  the  major  and  interactive  effects  of  each  of  the  18 
variables.  Standard  product  moment  correlation  analysis 
Indicated  a  nonsignificant  relationship  between  accuracy 
and  speed. 

Hoisman  et  al's  major  conclusions  were  several.  Of 
prime  Interest  to  this  research  is  their  stated  conclusion 
the 

the  addition  of  a  color  as  an  enhancement  of  a 
symbol's  di sti nguishabi 1 ity  does  not  Increase 
performance  and  may  even  lessen  it  (Hoisman  et 
al . ,  1963,  p.  7) 

Color  here  was  added  coding  (redundant)  that  was  used 
with,  not  Instead  of,  shape  and  size  codes. 

Brooks,  1965,  studied  the  effects  of  using  different 
redundant  color  codes  in  a  search  task.  He  varied  the 
colors  and  the  number  of  colors  used.  Coding  consisted  of 
adding  a  bar  of  color  beneath  the  target.  His  conclusions 
were  based  on  data  col lected  from  six  groups  of  10 
subjects  each.  All  subjects  were  Informed  in  advance  their 
particular  color  coding  condition.  Color  resulted  in 
significantly  better  search  times  than  the  no  color 
condition.  Neither  the  number  of  colors,  the  particular 
color,  the  display  background  used,  nor  any  interaction  of 
these  variables  had  any  significant  affect  on  performance 
time  (Barker  and  Krebs,  1977;  Brooks,  1965;  Cook,  1974). 


Munns,  1967,  was  Interested  In  the  effects  of  display 
symbol  variation  upon  operator  performance  In  aircraft 
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displays 
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color 
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color  (red  and 

blue) ) 

cod 1 ng 

of 

symbols  were  used 

as  1 ndependent 

variables.  All  combinations  were  given  to  each  of  12 
Lehigh  University  male  subjects.  Their  task  was  to  detect 
and  discriminate  enemy  and  friendly  aircraft  presented  on 
an  8  x  10  problem  sheet  Inserted  In  a  box  that  was  used  to 
simulate  a  radar  display.  Every  effort  was  made  to  reduce 
the  subject's  decision  process  to  the  simplest  level. 
Time  to  complete  the  task  and  number  of  errors  (enemy 
aircraft  missed)  were  Munn's  dependent  variables.  Using 
the  results  from  a  2  way  analysis  of  variance  he  concluded 
that  color  reduced  performance  time  and  made  the  subjects 
feel  more  secure  but  there  was  no  Indication  color  reduced 
subject  errors.  Display  density  or  the  interaction 
between  density  and  color  were  not  significant  (Munns, 
1 968 )  . 

Lee,  1969,  assessed  the  comparative  effectiveness  of 
colors  In  a  target  search  task.  Thirty-six  subjects  were 
timed  on  their  search  and  count  tasks  of  targets  displayed 
on  a  color  CRT.  Color  was  superior  to  black/white  coding 
for  both  tasks  (Barker  and  Krebs,  1977). 


Dooley  and  Harkins,  1970,  designed  an  experiment  with 
the  purpose  of  examining  the  effect  of  color  on  attention 
and  recall.  A  standard  flipchart  was  their  display 
device.  On  It  color  (treatment)  was  used  In  three  ways: 
functional,  attent lon-gett ?ng  and  monochrome.  Functional 
color  was  defined  as  employing  color  as  the  primary 
information  code.  If  the  color  was  removed  the 
communication  was  degraded.  Attention-getting  color  was 
defined  as  using  color  to  attract  attention.  If  the  color 
was  removed  the  communication  was  not  degraded. 
Monochrome  color  was  defined  as  the  void  of  color  or  the 
use  of  black/white  shadings  as  the  communication  medium. 
Dooley  and  Harkins'  performance  measures  were  scores 
derived  from  a  post  treatment  written  test  on  recall,  and 
the  number  and  amount  of  time  the  subjects  looked  at  the 
flipchart.  As  far  as  the  subjects  knew  the  flipchart  was 
irrelevant  to  the  experiment  In  which  they  were  engaged. 
It  Just  happened  to  be  standing  In  the  corner  of  the 
experimental  room. 

Twenty-three  men  and  22  women  employees  of  Xerox 
Corporation  were  randomly  assigned  to  one  of  the  treatment 
conditions.  The  flipchart  contained  a  71  x  97  cm  chart 
displaying  bar  graphs  of  comparative  Information  of  three 
companies  on  three  different  topics.  Xerox  was  one  of  the 


companies.  The  legend  and  bar  graphs  were  coded  as 
monochrome  (black/whf te/shadlng) ,  functionally  colored 
(red  green  blue)  or  attention  getting  colored  (monochrome 
conf iguration  with  colored  borders).  When  the  subject 
entered  the  room  to  participate  in  another  study  the 
treatment  was  mounted  on  a  flipchart  stand  in  a  corner 
approximately  30  degrees  to  one  side  of  the  subject.  It 
was  not  mentioned  by  the  experimenter  who  was  evasive  If 
asked  about  It  by  the  subject.  The  experimenter  left  the 
room  on  two  occasions  for  2.5  and  l  minute  respectively 
for  valid  reasons. 

Dooley  and  Harkins  used  a  camera  to  record  number  and 
length  of  times  subject  spent  looking  at  the  flipchart. 
These  data  were  then  order  ranked  for  all  part i c i pants . 
Measure  of  attention  was  the  sum  of  the  rankings  of  the 
two  recorded  indexes.  A  Kruskal -Wal 1 1 s  test  on  the  mean 

attention  ranks  showed  the  three  treatments  to  be 
different  though  a  subsequent  chi-square  test  indicated  a 
significant  difference  only  between  color  and  monochrome. 
From  this  Dooley  and  Harkins  concluded  more  attention  was 
given  to  the  colored  charts. 

At  the  conclusion  of  the  experiment  the  flipchart  was 
covered  and  each  subject  was  asked  to  complete  a  written 
test  whose  questions  were  based  on  the  flipchart's 
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Information.  The  questions  ranged  from  general  to 

specific,  each  with  a  score  of  one  point  if  answered 

correctly.  Measure  of  recal 1  was  the  sum  score.  The 

subjects'  overall  scores  were  then  ranked  and  analyzed 

using  a  Kruskal-Wal 1  is  one-way  analysts  of  variance  test. 

There  was  no  evidence  to  support  the  hypothesis  that  color 

had  an  effect  on  recall.  From  this  they  concluded 

it  is  Improper  to  assume  that  the  addition  of 
color  to  graphic  communications  will  Increase 
their  effectiveness  whether  the  color  is  used 
functionally  or  decorat  1 vely. . .There  are 
undoubtedly  applications  In  graphic 

communications  where  the  addition  of  color  would 
have  a  clear  beneficial  effect  over  that 
expected  for  the  best  available  black  and  white 
code,  but  the  present  example  is  not  one  of 
those  cases.  Since  information  on  the 
effectiveness  of  color  In  graphic  communications 
has  substantial  practical  significance,  more 
research  in  this  area  is  clearly  needed  (Dooley 
and  Harkins,  1970,  p.  854). 

Beyer,  Schenk  and  Z let  low,  1971,  compared  the 
effectiveness  of  color  coding  to  that  of  brightness  coding 
when  using  the  CRT  as  the  display  device.  Effectiveness 
was  measured  as  task  response  time  and  subject  preference. 
Sixty-eight  subjects  were  involved  In  four  experimental 
tasks:  locating,  tracking,  identifying  and  giving 

subjective  evaluation.  Using  statistical  analysis 
techniques,  response  time  was  significantly  faster  for  the 
color  coding  for  the  first  three  tasks.  However  Beyer  et 
al .  comment  the  difference  was  probably  not  of  practical 


4 

I 

importance.  Subjective  preference  favored  the  color  coded 
displays  (Barker  and  Krebs,  1975). 

Kanarick  and  Petersen,  1971,  recognized  that  few 
studies  had  examined  the  effects  of  redundant  coding, 
particularly  color,  on  retention  when  the  pay  off  ratios 
were  varied.  Payoff  ratios  x:y  were  defined  as  the  number 
of  points  awarded  for  a  correct  response/column.  For  half 
of  the  columns  x  points  were  scored  for  a  correct 
response.  For  the  other  half,  y  points  were  scored.  The 
subjects  were  Informed  prior  to  the  experiment  the  point 
value  of  each  column.  Using  a  mixed  factorial  design  with 
the  coding  condition  being  the  between  subjects  variable, 
they  analyzed  the  effect  of  using  or  adding  color  to  a 
code.  The  task  used  was  one  of  remembering  how  many  times 
a  particular  code  was  presented  In  a  particular  column. 
They  had  ten  columns  (A-J),  and  six  codes.  The  codes  were 
three  types:  numbers  (1-6),  colors  (blue,  violet,  yellow, 
red,  orange,  green)  or  uniquely  colored  numbers  (blue  1, 
violet  2,  etc.).  Three  treatment  sessions  were 
administered:  training,  4:1  and  1:1.  The  training  session 

familiarized  each  of  the  56  paid  undergraduate  students 
with  the  task.  During  the  4:1  payoff  session,  the 
subjects  were  told  they  would  receive  four  points  for  a 


correct  response  In  specified  columns  and  one  point  for  a 
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correct  response  in  the  other  half.  The  third  session  was 
the  -same  as  the  second  except  one  point  was  gfven  to  all 
correct  responses.  Subjects  competed  for  monetary  bonuses 
based  on  total  points.  Per  cent  correct  response  was  the 
measure  of  performance. 

Based  on  analysis  of  variance  and  Duncan's  New 
Multiple-Range  Test,  Kanarick  and  Petersen  concluded  that 
redundant  color  coding  of  numerical  information  did  not 
facilitate  performance.  Subjects  were  more  attentive  to 
the  point  value  of  the  column  and  not  the  color  when  the 
payoff  was  4:1.  Redundant  coloring  did  not  enhance 
performance  with  the  1:1  payoff  condition  (Kanarick  and 
Petersen,  1971). 

Shontz,  Trumm  and  Williams,  1971,  chose  aeronautical 
charts  as  the  display  device  in  their  examination  of  the 
effects  of  color  on  performance  when  searching  for 
geographic  checkpoints.  The  ground/map  coordination  task 
was  similar  to  that  of  a  pilot  who  had  to  locate  his 
position  after  being  geographically  disoriented. 
Cumulative  search  time  was  the  measure  of  performance. 
The  independent  variables  of  interest  were  coding  (uncoded 
chromatic,  coded  chromatic,  coded  achromatic),  clutter  and 
code  size. 

To  gather  their  data,  Shontz  et  al .  had  33  Junior  and 


senior  Air  Force  Reserve  Officer  Training  Cadets  who  were 


currently  enrolled  in  pilot  and  navigator  training  locate 
48  checkpoints  on  a  14  x  14  inch  map.  They  were  seated  in 
a  modified  tilted  dental  chair  at  a  viewing  distance  of  20 
inches.  Shontz  et  al.  Identified  the  28  color  codes  in 
Munsell  Book  Notation.  A  mixed  design  was  used  with  map 
coding  as  between-subject  variable  and  background  clutter 
and  code-alphabet  size  as  within  subject  variables. 

|  Results  from  the  Kol mogorov-Smi rnov  two  sample  test 

indicated  differences  between  distributions  of  cumulative 
search  times  was  significant  between  those  using  color 

i 

I  coded  maps  and  uncoded  maps.  Location  of  checkpoints  was 

faster  for  those  having  color  coded  maps.  However  the 
difference  was  not  significant  when  comparing  coded 
I  chromatic  and  coded  achromatic  conditions,  despite  the 

fact  those  viewing  achromatic  maps  reported  difficulty 
distinguishing  between  roads  and  rivers  of  similar 

configuration.  Based  on  this  finding  Shontz  et  al. 
question  whether  properly  designed  and  coded  achromatic 
maps  would  Improve  performance  sufficiently  under  varying 
conditions  to  warrant  their  consideration  (Shontz,  Trumm 
and  Will  lams,  1971 ) . 

Markoff,  1972,  varied  the  resolution  in  his  study  on 
the  Impact  color  had  on  target  recognition.  Analysis  of 


the  data  on  48  subjects  indicated  that  chromatic  real 
world  imagery  resulted  in  significantly  faster  target 
identification  times  than  did  comparable  achromatic 
screens.  This  effect  was  more  pronounced  as  the 
resolution  decreased  (Barker  and  Krebs,  1977). 

Wedel 1  and  Alden,  1973,  tested  the  hypothesis  that 
color  was  superior  to  numeric  coding  in  a  modified  keeping 
track  task  particularly  as  the  total  number  of  items 
displayed  Is  Increased.  Accuracy  was  the  measure  of 
performance  for  the  36  male  subjects  who  simulated  the 
task  of  an  air  traffic  controller  who  had  to  detect 
changes  from  display  to  display.  The  colors  used  were 
blue,  purple,  red,  orange,  yellow  and  green.  Based  on 
analysis  of  variance  analysis  on  error  type  they  concluded 
color  can  aid  in  retaining  information  concerning  number 
of  items  presented  and  their  location  but  does  not  help  in 
identification  (Wedel 1  and  Alden,  1973). 

Saenz  and  Riche,  1974,  conducted  a  study  that 
examined  the  effects  on  search  time  when  shape  and  color 
codes  were  used  separately  and  redundantly.  Target  and 
backgrounds  were  varied  by  color  and  shape.  The  three 
independent  variables  were  color  code,  shape  code  and 
color  shape  (redundant  code).  The  colors  were  identified 
by  Colormatch  Paper  and  Include  red,  yellow-red,  yellow. 
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green-yellow,  green,  blue-green  and  blue.  The  shapes  used 
were  circle,  hexagon,  triangle,  cross,  diamond,  star  and 
square.  The  dependent  variable  was  time  required  for  each 


of  the 

24 

male 

undergraduate  students 

to  locate 

and 

respond 

to 

six 

targets  in  each  of  48 

settings. 

The 

display  device  was  a  thick  white  illustration  board  that 
contained  the  target  and  background  stimuli  which  was 
inserted  in  a  tilted  box  for  ease  of  visual  access.  All 
students  were  required  to  pass  the  Dvorine 
Pseudo- I sochromat 1c  test  of  color  vision. 

Saenz  and  Riche  performed  an  analysis  of  variance  on 
the  logarithmic  transform  of  search  time  in  seconds  and 
applied  Tukey's  HSD  test  for  pairwise  comparisons  among 
the  means  of  main  effects.  Redundant  and  color  codes  were 
found  to  be  equally  effective  and  more  so  than  shape  code. 
The  authors  suggested  that  although  redundancy  may  not 
degrade  operator's  response,  it  may  not  help  it  either. 
Therefore  cost  should  be  a  factor  in  the  design  of  the 
code  for  display  (Saenz  and  Riche,  1974). 

Cahill  and  Carter,  1976,  researched  the  Impact  of 
display  density  and  color  code  size  in  a  search  task. 
Twenty  subjects  were  asked  to  search  for  three  digit 
numbers  on  displays  ranging  In  density  from  10  to  50  items 
coded  in  1  through  10  colors.  Based  on  their  analysis  of 
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search  time,  they  recommend  seven  colors  as  the  optimal 
number  of  colors  to  be  employed.  Fewer  and  more  colors 
|  increased  search  time  (Barker  and  Krebs,  1977). 

Keister,  1981,  was  one  of  the  first  to  investigate 
and  report  on  the  effects  of  color  coding  in  other  than 
the  traditional  search,  locate  and/or  identify  type  tasks 
that  usually  concentrated  on  random  patterns  of  letters, 
numbers,  geometric  shapes  and  color.  Keister's 
contribution  involved  the  organized  task  of  data  entry  at 
two  levels  of  complexity:  adding  and  changing  an 
inventory  item.  Time/transaction  and  errors  were  the 
measures  of  performance.  Two  laboratory  experiments  were 
designed  to  ascertain  the  effects  of  color  coding  on 
operator  performance  when  entering  or  changing  product 
information  in  an  inventory  data  base.  The  mode  of  visual 
display  was  a  color  or  green  phosphorus  CRT  with  keyboard 
interface.  Data  format  was  color  or  monochrome  tabular 
text.  All  subjects  were  experienced  in  typing  and  data 
entry  but  not  in  the  specific  task. 

The  first  experiment  involved  having  the  eight  paid 
female  subjects  enter  data  in  two  sessions  and  change  it 
in  two  other  sessions.  Each  subject  used  both  codes 
(monochrome  and  color)  for  each  task,  presented  in  random 
order.  The  authors  reported  the  overall  results  showed 


data  entry  of  a  new  Item  was  somewhat  more  rapid  when 
using  the  color  code.  The  data  indicated  that  color 
accounted  for  even  greater  speed  for  the  change  task.  The 
increase  was  about  two  seconds/entry  which  was  not 
statistically  significant.  After  analyzing  the  order  of 
presentation  the  authors  suggested  the  data  supported  the 
conclusion  that  color  facilitates  learning  and  offers  a 
permanent  advantage  1 n  speed  for  those  who  cont 1 nua 1 1 y 
switch  data  entry  tasks. 

In  their  second  experiment  54  National  Cash  Register 
(NCR)  software  development  personnel  were  asked  to 
complete  the  data  change  task  as  quickly  and  accurately  as 
possible.  Green  phosphorus,  compatible  color  and 

conflicting  color  presentat Ions  were  used  by  the  subjects 
during  their  24  transactions.  Color  usage  was  compatible 
across  tasks  If  a  particular  color  always  symbolized  the 
same  information  (l.e.  yellow  always  denoted  price). 
Color  usage  was  conflicting  If  a  particular  color 
symbolized  different  Information  across  tasks  (i.e.  yellow 
denoted  price  In  one  task  and  Inventory  quantity  in 
another).  Density  was  varied  by  number  of  products  listed 
on  screen:  one,  two  or  four.  As  In  the  their  first 

experiment,  the  overall  effect  of  color  on  total  time  was 
not  significant.  However  the  authors  suggested  that  color 
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was  superior  when  the  subject  was  using  the  most  complex 
screen  which  contained  four  products. 

Keister  concluded  the  overall  performance  on  color 
displays  was  superior  to  that  on  monochrome  displays 
despite  the  lack  of  statistical  significance.  Based  on 
his  reports  that  color  was  superior  in  all  experiments, 
Keister  suggested  a  motivational  interpretation.  Color 
can  increase  operator  motivation  and  facilitate 
performance  by  making  the  task  more  interesting.  In 
moderately  complex  data  entry  task  (locating  an  item  and 
keeping  track  of  entry  activities)  color  provided  only  a 
minimal  advantage.  In  a  more  complex  task  (which  required 
the  entry  of  a  number  before  making  a  change),  color  aided 
in  the  locating  and  keeping-track  tasks.  Color  also 
facilitated  the  switching  among  different  type  tasks. 
These  results  were  based  on  mean  times  of  performance  that 
were  not  statistically  significant.  The  author  did  not 
indicate  how  subjective  opinion  was  obtained  from  the 
participants  though  he  purports  color  added  to  motivation 
and  decreased  boredom. 

Tullis,  1981,  also  designed  a  study  that  used  other 
than  the  traditional  search,  locate  and/or  identify  type 
tasks.  He  described  his  task  of  diagnosing  the  condition 
of  telephone  lines  as  a  typical  example  of  a  large  class 


of  computei — aided  decision-making  tasks.  Subjects  had  to 
integrate  various  classes  of  data  in  order  to  make  a 
timely  decision  about  the  status  of  some  real  world 
system.  His  setting  was  a  day  training  session  for  eight 
Bell  Laboratory  employees  who  had  experience  in  the  Bell 
System  ranging  from  4  to  27  years  though  not  in  the 
particular  tasks  used  in  the  experiment.  The  display 
devices  were  four  types  of  formats  presented  on  a  color 
graphic  CRT:  narrative  (complete  words,  and  phrases), 
structured  (fixed  tabular  form),  black  and  white  graphics 
(structured  plus  schematic  and  shape  coding  for  condition 
of  telephone  lines),  and  color  graphics  (black  and  white 
graphics  format  with  color  coding  substituted  for  shape 
coding  for  condition).  The  colors  used  were  consistent 
with  the  population  stereotypes:  green  for  good,  red  for 
bad  and  ye  1  low  for  marg Inal.  The  day  was  d i v i ded  i nto  1 0 
training  sessions.  The  first  six  sessions  were  text-based 
that  familiarized  the  subjects  with  the  electrical 
characteristics  of  telephone  lines.  The  last  four 
sessions  were  CRT  based  and  were  designed  to  acquaint  the 
subject  with  one  format/session  and  then  test  the  subject 
using  two  sets  of  37  different  displays  accompanied  by  a 
24  question  multiple  choice  test.  Subjects  completed  a 
format  session  after  they  achieved  at  least  907.  accuracy. 


They  then  proceeded  on  to  the  next  session.  The  questions 
ranged  from  simple  ident i f i cat  ion  to  complex  integration 
and  decision  making.  Speed,  accuracy  and  number  of 
training  sessions  needed  were  the  measures  of  performance. 
A  subjective  questionnaire  evaluating  the  four  formats  was 
administered  at  the  day's  end. 

To  control  for  practice  effects  Tull  is  used  a  4  x  4 
Latin  Square  design  to  determine  the  order  of  presentation 
of  format/ sub j ect .  Analysis  of  variance  of  data  indicated 
that  accuracy  did  not  vary  with  format.  The  80%  training 
accuracy  criterion  may  have  eliminated  any  significance. 
However  response  time  was  affected  by  format.  Comparisons 
of  means  using  Tukey's  HSD  test  suggested  the  response 
time  when  using  the  graphics  format  was  significantly 
shorter  than  when  using  the  narrative.  There  were  no 
significant  differences  in  response  times  between  the  two 
graphics  conditions  of  color  and  black/white.  Fewer 
training  sessions  were  required  to  achieve  the  required 
accuracy  for  the  graphics  over  the  narrative  format. 

The  only  difference  between  the  two  graphics  formats 
arose  in  the  subjective  ratings.  Seven  of  the  eight 
subjects  indicated  they  preferred  color  over  black/white 
and  found  the  color  format  aesthetically  pleasing  (Tull  is. 


Boeing,  1982,  completed  a  study  whose  purpose  was  to 

evaluate  the  CRT  pictorial  formats  and  their  effect i veness 

in  presenting  integrated  information  to  the  pilot  (Scott, 

1983).  One  of  the  issues  under  investigation  was  how  the 

color  formats  compared  to  those  in  monochrome.  Their 

analysis  was  based  on  data  collected  on  18  experienced 

military  pilots  who  flew  two  20  minute  simulated 

air-to-air  and  all — to-ground  composite  missions.  Their 

cockpits  were  outfitted  with  CRT  instruments.  One  of  the 

display  conf igurat ions  was  monochrome,  similar  to  the 

current  generation  aircraft  CRT  display.  A  second  was  a 

color  vector  graphics  display  which  produced  stick-like 

figures.  The  third  configuration  used  a  color  raster  scan 

system  which  created  solid  color  drawings. 

The  subjective  and  objective  results  differed.  The 

pilots  overwhelmingly  preferred  the  color  over  the 

monochrome  configuration.  However,  the  objective  data  on 

performance,  measured  as  speed  in  interpreting  and 

responding  to  display  information,  did  not  contain 

sufficient  evidence  to  report  any  differences  between  the 

three  configurations.  Mr.  Reising,  from  the  Flight 

Dynamics  Laboratory,  commented  the  study  did  show 

that  color  pictorial  displays  probably  ease  the 
pilot's  Information  assimilation  task  when 
viewing  a  complex  display. . .although  more 
studies  are  required  (Scott,  1983,  p.  232). 


Christ,  Corso  and  Teichner,  1975,  1977  and  1983  are 
presently  the  key  figures  In  the  field  of  study  of  the 
effects  of  color  on  human  performance  when  used  In  a 
display  for  Information  transfer.  All  of  their  studies 
here  were  supported  under  contract  by  the  Office  of  Naval 
Research  with  the  purpose  of  evaluating  the  basis  of 
possible  design  recommendat ions  for  or  against  the  use  of 
color  In  aircraft  displays.  Their  first  step  was  a 
detailed  search  of  the  literature  and  analysis  of  findings 
(Christ,  1975).  Based  on  this  analysis  and  noted  glaring 
gaps  in  knowledge,  experiments  were  designed  and  carried 
through  In  attempts  to  narrow  some  of  these  gaps. 
Experience  of  operator  and  level  of  task  were  their  first 
concerns.  Previous  studies  had  unskilled  subjects  doing 
simple  tasks.  Christ  and  Corso  (1975)  trained  their 
subjects  for  almost  a  year  on  tasks  of  varying  difficulty 


before 

collecting  data. 

In 

1977 

Chr 1 st. 

Corso 

and 

Teichner 

expanded  their 

study 

to 

1 ncl ude 

exper 1 enced 

operator 

performance  1 n 

a 

more 

rea  1 

world 

type 

environment.  Christ  had  the  results  of  all  their  studies 
published  in  1983  with  the  conclusion  that 
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the  relative  effectiveness  of  different  visual 
display  codes  depends  upon  other  display 
conditions,  the  task,  and  the  dependent  measure 
used  to  make  the  comparison  (Christ  and  Corso, 

1983,  p.  83). 

This  paper's  literature  review  concludes  with  a 
summary  table  of  the  experimental  research  previously 
discussed  (Table  2.1,  Summary  of  Experimental  Research  on 
the  Effects  of  Color)  and  Christ  et  al.'s  works.  These 
authors'  contributions  serve  as  a  summary  not  only  because 
of  their  recency  but  because  of  the  contents.  Their 
thorough  review  and  analysis  of  the  literature  took  into 
account  all  studies  reported  above  except  six.  The  works 
of  Hoisman  et  al.,  1963,  Lee,  1969,  Beyer  et  al.,  1971, 
Cahill  and  Carter,  1976  and  Keister,  1981,  Boeing,  1982 
were  not  Included  either  because  of  data  requirements  or 
date  of  pub 1 1  cat  1  on . 

Table  2.1,  Summary  of  Experimental  Research  on  the 
Effects  of  Color  by  Author  and  Date,  presents  a  concise 
skeleton  outline  of  the  works  reviewed  in  this 
experimental  section.  As  in  the  narrative,  the 
researchers  are  listed  in  chronological  order.  The  tasks 
and  sample  size  from  which  their  results  were  generalized 
are  given  where  available.  The  dependent  variable  or 
performance  measure  is  stated.  The  Color  Significant 


entry  indicates  whether  their  results  showed  that  using 
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l 

•i 

I 

SUMMARY  "  J 

EXPERIMENTAL  RESEARCH  ] 

on  the 

EFFECTS  of  COLOR  In  VISUAL  DISPLAYS 

by  ! 

AUTHOR  and  DATE  _  ■ 


TASK 

No. 

PERFORMANCE 

COLOR 

RESEARCHER (S) 

DATE 

TYPE 

Sub. 

MEASURE 

SIGN  IF 

Green  et  a  1 . 

1953 

Search 

Time 

Y 

Green  Anderson 

1956 

Search 

Time 

Y 

Chr i stner 

1961 

Locate 

5 

Time 

Y* 

Ray 

Count 

5 

Time 

Y* 

Identify 

5 

Time 

N 

Hitt 

1961 

Locate 

10 

Accuracy/T i me 

Y 

Count 

10 

Accuracy/Time 

Y 

! dent  1 fy 

10 

Accuracy/Time 

N 

Compare 

10 

Accuracy/T i me 

Y 

Verify 

10 

Accuracy/Time 

Y 

Newman 

1961 

Locate 

Time 

Y 

Dav  i  s 

Decode 

Time 

Y 

Schutz 

1961 

Pt .  Read 

10 

T  ime 

Y 

Compare 

10 

Time 

N 

Jones 

1962 

Identify 

Time 

N 

Locate 

Time 

Y 

Smith 

1962 

Search 

12 

Time 

Y 

12 

Accuracy 

Y 

Smith 

1963 

Search 

12 

Time 

Y 

Count 

12 

Time 

Y 

12 

Accuracy 

Y 

Smith 

1964 

Count 

8 

Time 

Y 

Thomas 

8 

Accuracy 

Y 

Smith  Thomas 

1965 

Count 

12 

Time 

Y 

Farquhar 

Compare 

12 

Time 

Y 

Ho i sman  et  a  1 . 

1963 

I  dent  1 fy 

68 

Time 

N 

68 

Accuracy 

N 

Locate 

68 

Time 

N 

68 

Accuracy 

N 

Extract 

68 

Time 

N 

68 

Accuracy 

N 

Integrate 

68 

Time 

N 

68 

Accuracy 

N 

not  statistically  significant 
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Corre 1  ate 

68 

T 1  me 

N 

68 

Accuracy 

N 

Analyze 

68 

Time 

N 

68 

Accuracy 

N 

Brooks 

1965 

Search 

60 

Time 

Y 

hunns 

1967 

Srch/ Idnt 

12 

Time 

Y 

12 

Accuracy 

N 

12 

Subjective 

Y 

Lee 

1969 

Search 

36 

Time 

Y 

Count 

36 

Time 

Y 

Dooley 

1970 

Attent 1  on 

45 

Time 

Y 

Harkf  ns 

Recal  1 

45 

Accuracy 

N 

Beyer 

1971 

Locate 

68 

Time 

Y 

Schonk 

I dent i f y 

68 

Time 

Y 

2 1 et 1 ow 

Track 

68 

Time 

Y 

Kanar 1 ck 

1971 

Recal 1  4:1 

56 

Accuracy 

N 

Peterson 

Recal 1  1:1 

56 

Accuracy 

N 

Shontz  et  a  1 . 

1971 

Locate 

33 

Time 

N 

Markoff 

1972 

I dent i f y 

48 

Time 

Y 

Wedel 1 

1973 

Recal  1 

36 

Accuracy 

Y 

Alden 

I dent i f y 

36 

Accuracy 

N 

Saenz  Riche 

1974 

Locate 

24 

Time 

Y 

Cahill  Carter 

1976 

Search 

20 

Time/Density 

Y 

Kel ster 

1981 

Data  Entry 

8 

Time 

Y* 

Data  Entry 

54 

Time 

Y* 

Tul 1  is 

1981 

Diagnostic 

8 

T  ime 

N 

8 

Subjective 

Y 

Boe 1 ng 

1982 

Inter/Resp 

18 

Time 

N 

18 

Subject i ve 

Y 

Chr 1 st 

1975 

Identify 

8 

T  ime 

N 

Corso 

Srch/Locate  8 

Time 

Y 

Reca 1 1 

8 

Time 

N 

Telchner 

1977 

Comb  1 ned 

8 

Time 

N 

Chr 1 st 

Combined 

8 

Time 

N 

Corso 

8 

Accuracy 

N 

I ntegrate 

8 

Time 

N 

color  as  a  code  Improved  performance  over  other  achromatic 
codes.  Statistical  significance  was  the  criterion.  The 
method  of  analysis  is  not  shown  as  the  majority  of  the 
studies  reported  applying  the  fixed  or  mixed  effects 
analysis  of  variance  techniques.  However  it  was  noted 
that  the  assumptions  validating  the  use  of  the 
mathematical  model  were  seldom  mentioned  or  addressed  in 
any  of  the  studies'  publications. 

Christ,  1975,  searched  the  literature  from  1950 
through  1973  for  studies  involving  experimental  research 
on  the  effects  of  color  when  used  as  a  code  for 
information  in  visual  displays.  Forty-two  studies  were 
found  that  satisfied  the  following  five  criteria.  The 
dependent  measure  was  per  cent  correct,  information 
transmitted  or  response  time.  Second,  the  data  yielded  a 
comparison  between  chromatic  and  achromatic  codes  and  were 
available.  Third,  the  task  required  Information 
extraction  from  symbolic  or  pictorial  displays  rather  than 
from  direct  viewing.  Fourth,  subjects  were  normal  young 
adults.  Finally,  the  procedures  and  general  experimental 
design  were  acceptable. 

In  order  to  analyze  the  data  among  different  studies 
which  used  different  procedures  for  measuring  the  same 
dependent  variables,  Christ  normalized  the  scores  within 
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each  experiment  while  holding  the  other  task  parameters 
constant.  For  the  dependent  variable  accuracy,  the  score 
was  calculated  using 

%  Difference  in  Score  = 

Color  -  Achromatic  x  100  (1) 

Achromatic 

Search  time  was  calculated  using 

T.  Difference  in  Time  = 

Achromatic  -  Color  x  100  (2) 

Achromat i c 

Positive  values  for  either  score  indicate  an  advantage  for 
color,  negative  scores  a  disadvantage  for  color,  relative 
to  a  particular  achromatic  target  attribute. 

Christ  categorized  the  results  of  the  acceptable 
studies  by  the  way  color  was  used  on  a  target 
(unidtmensional ,  multidimensional,  interference, 

completely  redundant,  partially  redundant)  and  by  task 
type  (identification  and  search).  A  target  was 
uni dimensional  if  it  differed  from  others  in  terms  of  only 
one  stimulus  dimension  while  all  other  attributes  were 
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held  constant.  Multidimensional  targets  differed  from 
other  targets  In  more  than  one  attribute.  A  target  had 
Interference  If  Irrelevant  color  was  added  to  the  display 
which  Interfered  with  the  subject's  ability  to  locate  the 
achromatic  target.  Color  was  used  as  a  redundant  code  If 
when  added  to  the  target  It  provided  more  information  than 
was  minimally  necessary  for  It  to  be  Identified.  If 
knowing  the  color  of  a  target  was  necessary  but  not 
sufficient  to  Identify  It,  color  acted  as  a  partially 
redundant  code.  I  dent  If  1  cat  Ion  tasks  were  measured  in 
terms  of  accuracy;  search  tasks  In  terms  of  time. 

Table  2.2,  Range  of  Percent  Difference  Scores, 
summarizes  the  results  of  Christ's  analyses  based  on  the 
normalized  data,  the  above  definitions  of  the  use  of  color 
and  the  type  of  tasks  the  research  findings  to  date  had 
Investigated.  The  data  within  the  table  show  the  range  of 
ga1n(+)  or  loss  (-)  that  has  been  reported  with  the  use  of 
colors  as  target  codes  relative  to  the  Indicated 
achromatic  coding  dimensions.  These  gains  and  losses  are 
expressed  in  terms  of  the  per  cent  change  (Equations  1  or 
2)  relative  to  the  achromatic  codes  shown  In  the  first 
column.  The  range  Is  based  on  n  number  of  comparisons. 

Christ  offers  this  table  only  as  a  starting  point  to 
designers  considering  Integrating  the  attribute  of  color 
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Table  2.2 

RANGE  of  PERCENT  DIFFERENCE  SCORES 
for  the 
USE  of  COLOR 


Identification  Task  Search  Task 

M 1  n  Max  n  Min  Max  n 


Uni  dimensional 


Brightness 

+ 

29 

+  32 

2 

+43 

+43 

1 

Size 

- 

6 

+  1  1  1 

6 

+40 

+40 

1 

Geometric  Shape 

- 

38 

+  33 

1  1 

+  6 

+42 

5 

Other  Shapes 

0 

+  1  18 

6 

+30 

+63 

2 

Letters 

- 

29 

-  15 

6 

+  10 

+  7 

2 

Digits 

Mu  1 1 1 d 1  mens  1 ona 1 

48 

+  26 

17 

-  3 

+42 

4 

Size 

- 

10 

+  176 

7 

0 

Geometr 1 c  Shape 

- 

28 

+202 

15 

+50 

+53 

3 

Other  Shapes 

- 

2 

+  62 

12 

+41 

+69 

6 

Letters 

4- 

4 

+  46 

4 

0 

Digits 

Interference 

51 

+  19 

6 

0 

Size 

- 

29 

0 

14 

0 

Geometric  Shape 

- 

42 

+  1 

4 

-  8 

-  8 

1 

Other  Shapes 

- 

43 

-  17 

4 

-10 

-  3 

2 

Digits 

Complete  Redundancy 

14 

+  2 

7 

0 

Size 

+ 

22 

+  60 

3 

+33 

+32 

1 

Br ightness 

+  1 

04 

+  104 

1 

+32 

+32 

1 

Geometric  Shape 

0 

+21 

+32 

2 

Letters 

0 

+53 

+63 

2 

Digits 

Partial  Redundancy 

+ 

2 

+  2 

1 

+60 

+74 

3 

Digits 

0 

-23 

+73 

20 

Maps 

+ 

1 

+  1 

1 

0 

Static-Ground  Photo 

+ 

29 

+  29 

1 

+32 

+47 

1 

Static-Aerial  Photo 

+ 

2 

+  2 

1 

+  17 

+  17 

1 

Dynamic-Aerial  Photo 

+ 

3 

+  3 

t 

0 

Dynamic-Aerial  TV 

+ 

3 

+  3 

1 

-  3 

-  3 

1 

(Christ,  1975,  p  563) 
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into  their  information  display.  These  results  are  based 
solely  on  works  that  satisfied  Christ  et  al.'s  criteria 
for  their  study.  Their  purpose  centered  around  the 
recommendation  of  the  use  of  color  in  aircraft  displays. 
Identification  and  search  performance  were  their  tasks  of 
interest. 

Christ  cautions  the  user  on  the  interpretation  of  the 
magnitude  of  the  given  ranges.  These  gain  and  loss 
figures  were  calculated  using  data  reported  in  the 
literature.  Very  few  studies  mentioned  or  controlled  for 
the  di scrlminabi 1 Ity  of  the  code.  Christ  suggests  this 
factor  may  account  for  the  large  scores  (+  or  -)  and/or 
large  spread  of  scores.  For  example,  if  the  achromatic 
code  was  more  di scr 1ml nabl e  than  the  comparative  color 
code,  the  loss  score  may  be  inflated.  However  in  another 
study  in  the  same  category,  if  the  color  code  was  made 
more  di scr imi nabl e,  the  gain  score  may  be  Inflated.  The 
overall  result  would  then  be  a  large  range  of  per  cent 
difference  scores.  Few  studies  have  addressed  this 
dlscr iminabl 1 ity  of  the  code  or  other  factors  such  as 
ambient  lighting,  luminance  conditions  or  target 
background,  all  of  which  may  have  a  significant  affect  on 
the  magnitude  of  the  scores  calculated.  Christ  also 
pointed  out  over  half  of  the  ranges  were  derived  from  two 


or  fewer  studies.  Also  all  studies  incorporated  in  this 

table  used  inexperienced  subjects.  However,  when  keeping 

these  points  in  mind,  ‘the  user  can  consider  this  table  as 

a  concise  summary  of  what  the  literature  has  reported  when 

examining  color  coding  in  visual  displays. 

Christ's  secondary  objective  in  searching  the 

literature  was  to  Identify  gaps  in  knowledge  which  would 

prevent  a  sound  decision  on  whether  to  integrate  color  in 

aircraft  displays.  He  ascertained  that 

the  data  presented  in  this  report  point  to  many 
general  and  absolute  gaps  in  knowledge.  There 
is  simply  no  data  available  for  making  some 
color  code-achromatic  code  comparisons. 
Furthermore,  even  when  several  comparative 

values  are  available  in  the  literature,  they 
often  have  been  obtained  under  relatively 

restricted  conditions.  Thus,  there  is  a  general 
need  for  more  parametric  research  designed  to 
provide  data  that  would  allow  a  broader  and  a 
more  representat J ve  comparison  between  color 
codes  and  various  achromatic  codes.  (Christ, 

1975,  p.  563) 

Christ  lists  seven  specific  areas  that  the  research  to 
date  has  fa  1 1 en  short . 

1.  The  bulk  of  the  data  available  is  derived  from 
subjects  inexperienced  with  the  given  task.  This  can 
restrict  generalizations  of  conclusions. 

2.  Without  exception  the  subjects  were  in  a 
laboratory  environment  where  they  directed  their  full 
attention  to  the  single  simple  task.  Real  world  displays 
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usually  have  to  compete  for  the  operator's  time  with  the 
surrounding  environment. 

3.  The  data  to  date  have  confounded  the  display 
density  and  exposure  time.  Both  of  these  variables  are  of 
important  consideration  for  display  design. 

4.  Little  data  are  available  on  how  the  memory  for 
colors  deteriorates  during  a  retention  interval. 

5.  The  issue  of  when  a  subject  can  most  effectively 
use  a  selection  criterion  based  on  color  of  a  briefly 
exposed  display  has  not  been  fully  explored. 

6.  Some  data  have  suggested  color  can  interfere  with 
the  accuracy  of  identifying  and  the  speed  of  locating 
achromatic  target  attributes.  Hardly  any  data  are 
available  to  indicate  why. 

7.  Though  redundancy  of  color  has  not  generally  been 
found  to  aid  in  Identification,  does  this  hold  true  for 
practiced  operators  and/or  for  targets  whose  other 
features  are  degraded? 

Based  on  these  delineations  of  the  voids  in 
experimental  research,  Christ  and  his  associates  designed 


and 

performed 

a 

ser 1 es 

of 

experiments  in 

attempts  to 

narrow  the  gaps. 

The 

impetus  behind  their 

research  was 

the 

question 

of 

whether 

or 

not  to  Integrate  color  into 

aircraft  displays. 


Christ  and  Corso  included  experience  of  subject  in 


the  experimental  design.  The  eight  highly  practiced  (over 
one  year  of  training)  paid  male  subjects  performed  three 
simple  discrete  tasks  in  isolation.  The  tasks  included 
choice  reaction  time  which  required  no  memory,  search  and 
locate,  and  information-memory  task.  Codes  (letters, 
digits,  shapes  and  color  dots)  and  density  (low  and  high) 
were  the  Independent  variables  with  reaction  time  and 
accuracy  as  the  dependent  variables.  The  colors  used  were 
purple,  blue,  green,  yellow,  orange  and  red. 

After  analyzing  the  data  col lected  from  384 
trials/subject/condition  Christ  and  Corso's  findings  were 
concurrent  with  earlier  results  using  inexperienced 
subjects.  For  the  choice  reaction  (Identify)  task,  digits 
provided  for  the  best  performance  while  color  was 
consistently  associated  with  poorest  performance.  In  the 
search  and  locate  task,  color  was  found  to  be  relatively 
advantageous  for  target  codes  In  high  density  displays. 
Digits,  shapes  and  color  did  not  differ  significantly  from 
each  other  as  the  best  of  the  four  codes  to  use  1 n  an 
informat  ion -memory  task  that  required  limited  viewing  time 
and  short  term  memory.  Their  overall  conclusion  was  that 
experience  did  not  change  the  effects  of  using  color  as  a 
code  in  simple  isolated  tasks  (Christ  and  Corso,  1983; 


o»V 


Christ,  Teichner  and  Cor so,  1977). 

Teichner,  Christ  and  Corso,  1977,  felt  more  realistic 
laboratory  representations  of  an  operational  setting 
needed  to  be  included  in  the  research.  Their  problem 
statement  was,  within  a  more  complex  operation  will  color 
coding  reveal  effects  not  seen  in  an  isolated  simple  task 
setting.  Using  the  same  three  tasks  and  the  same  highly 
practiced  subjects  as  described  above  in  Christ  and  Corso, 
1975,  they  designed  three  experiments  which  combined  and 
integrated  the  tasks  to  achieve  a  more  realistic  setting. 
The  first  experiment  combined  all  three  tasks  whose 
displays  were  all  on  a  pane!  in  front  of  the  subject.  A 
task  was  initiated  at  random  for  the  subject  to  complete. 
The  overall  results  indicated  there  were  no  significant 
differences  between  the  codes  across  all  the  tasks  except 
one.  As  found  In  their  earlier  study,  color  was  found  to 
be  the  best  code  for  targets  in  a  high  density  display. 
The  order  for  codes  from  slowest  to  fastest  was  letter, 
digit,  shape  and  color  dots,  with  the  only  statistically 
significant  difference  being  between  letter  and  color. 
Again  the  use  of  color  appeared  to  be  task  dependent. 

The  second  experiment  was  the  same  as  the  first  with 
the  addition  of  a  same-difference  comparison  task.  The 
same-difference  task  Involved  having  the  subject  determine 
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whether  the  display  content  remained  the  same  as  the 
previous  display  or  had  changed  (different).  New  subjects 
were  extensively  trained  on  the  Isolated  tasks.  Analysis 
of  that  data  showed  for  the  original  three  tasks,  color 
was  associated  with  the  slowest  response  time  In  the  low 
density  Identification-memory  task.  Otherwise  the  codes 
were  comparable  to  one  another.  In  the  same-difference 
comparison  task  colors  were  responded  to  with  less 
accuracy  and  speed  than  any  of  the  other  achromatic  codes. 
Christ,  Teichner  and  Corso  attribute  the  inconsistencies 
between  these  findings  and  those  from  the  first  experiment 
to  the  differences  In  practice  of  the  subjects. 

A  third  experiment  attempted  to  put  the  subject  in  a 
more  realistic  environment.  He  assumed  the  role  of  an  air 
traffic  controller  (ATC)  who  had,  as  a  part  of  his  job, 
the  completion  of  the  three  original  discrete  tasks  while 
monitoring  and  controlling  aircraft  altitude,  speed  and 
heading.  The  eight  original  subjects  were  used.  The 
results  showed  that  with  practice  to  proficiency  on  the 
ATC  task,  the  use  of  color  did  not  affect  either  job. 
There  was  no  difference  In  performance  on  either  the  ATC 
or  the  simple  discrete  tasks  that  could  be  attributable  to 
color. 

The  overal 1  conclusion  from  these  experiments  as  well 


as  the  earlier  ones  done  by  Christ  and  Corso  was  that 
color  as  a  code  "affects  performance  in  a  manner  no 

different  from  any  of  the  achromatic  codes  used" 

(Telchner,  Christ  and  Corso,  1977,  p.  33).  If  there  were 
differences,  practice  appeared  to  attenuate  those 
differences.  Color  does  not  have  a  unique  quality  for 
coding  in  a  display  over  other  codes  in  a  search  and 

locate  type  task.  It  was  sometimes  associated  with  the 
best  performance  while  other  times  with  the  worst  when 
using  response  time  and  accuracy  as  the  measures  of 

performance  (Christ,  1983;  Christ,  Telchner  and  Corso, 
1977)  . 

Conclusions  from  the  Literature 
Almost  all  studies  on  the  use  of  color  in  visual 
displays  have  been  conducted  using  search,  count,  locate 
and/or  identify  type  tasks.  Memory  was  included  in  only 
three:  Dooley  and  Harkins,  1970;  Christ,  1975;  and 

Telchner  et  al.,  1977.  Most  were  done  In  an  Isolated 
laboratory  setting  In  which  the  tasks  were  simple  and 
discrete.  Except  for  Telchner  et  al.,  all  generalizations 
on  the  use  of  color  have  been  based  on  subjects  lacking  in 
experience  in  the  task  required.  Most  studies  have  used 


letters,  digits  and  geometric  shapes  as  the  achromatic 
codes  or  brightness  with  which  color  was  compared  for 


effectiveness.  Without  exception  the  measure  of 
performance  of  operator  was  speed  and/or  accuracy. 

Given  these  conditions,  the  overall  finding  was  that 
the  relative  effectiveness  of  color  as  a  visual  display 
code  depended  upon  other  display  conditions,  the  task  and 
the  measure  of  performance.  Generally  when  performing  the 
simple  discrete  task  of  search,  locate  or  identify  in  an 


Isolated  setting. 

digit 

coding  was 

superior 

for 

the 

ident If i cat  Ion  task 

while 

color  was 

superior 

for 

the 

search  or  locate  task.  When  these  tasks  were  performed  in 
combination,  color  appeared  equal  or  superior  as  a  code  in 
reaction  and  location  time  and  for  recall,  but  inferior  in 
accuracy  and  speed  In  same-difference  comparison  tasks. 
When  code  was  employed  in  a  combined  task  which  was  a 

subset  of  the  overall  Job  and  the  operators  were 

proficient  at  all  tasks,  color  ranked  equally  with  the 

other  three  achromatic  codes.  These  conclusions  are  in 
conflict  with  the  subjective  reasoning  that  color  is 

obviously  better.  One  might  Justify  the  nonconcurrence  of 
the  theoretical  and  experimental  findings  in  how  color  is 
presently  used  as  an  Information  code.  It  may  not  be 
taking  advantage  of  the  dual  processing  capabilities  of 


There  continues  to  remain  glaring  gaps  In  the 

literature  on  the  knowledge  of  the  use  of  color  In  visual 
displays.  The  most  prominent  area  was  the  tasks  on  which 
the  findings  were  generalized:  search,  locate  and 

Identify.  These  are  only  a  small  subset  of  uses  of  visual 
displays  especially  with  the  pro 1  I f erat 1 on  of  the 

affordable  computers.  Yet  the  1 Iterature  continues  to 
dwell  only  in  this  narrow  field.  With  computer  graphics 
penetrating  almost  every  sector  of  the  technological 
world,  visual  displays  are  being  used  everywhere  for  many 
different  purposes  that  Involve  Information  processing. 
One  of  the  fastest  growing  sectors  of  users  are  managers 
and  decision  makers  who  are  expanding  their  DSS  (Berk  et 
al.,  1982;  McLamb,  1982).  These  consumers  have  the  option 
of  having  their  OSS  support  color  or  achromatic  coding  of 
their  information  whether  It  be  graphics  or  text. 

Presently,  to  a  limited  degree,  they  also  have  the  choice 
of  a  CRT  or  hardcopy  display  device  which  supports  the 
same  options.  Will  color  on  their  visual  displays  aid 
managers  In  their  primary  function  of  making  decisions  at 
different  levels?  The  subjective  literature  purports  It 
will.  They  advocate  color  will  speed  up  the  decision 
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making  process  significantly  as  well  as  allow  for  better 
management  recall  or  memory.  For  most  decision  makers 
time  and  accuracy  are  two  of  their  most  coveted  resources 
(Kelley,  1979;  McLamb,  1982;  Mlastkowskl  and  Wrege,  1981; 
Takeuchl  and  Schmidt,  1980;  Watson  and  Driver,  1983;  Zmud, 
1979) . 

The  experimental  literature  has  not  rigorously 
addressed  the  effects  of  the  use  of  color  displays  for  the 
decision  making  process.  However  If  existing  results 
could  be  generalized,  they  are  In  conflict  with  the 
popular  subjective  opinion.  The  experimental  literature 
concludes  the  effects  of  color  in  visual  displays  are  a 
function  of  the  task  with  the  overall  conclusion  that 
there  Is  not  evidence  to  support  color  Is  a  better  code 
than  other  achromatic  codes.  The  question  remains 
systematically  unexam l ned,  thus  unanswered,  on  whether 
color  visual  displays  will  aid  decision  makers  in  their 
information  processing  more  than  achromatic  visual 
displays. 

Researchers  in  the  discipline  of  Information  Systems 
(IS)  agree  with  those  In  Human  Factors  that  research  is 
very  scarce  In  the  area  of  the  use  of  visual  displays  In 
the  decision  making  process  (Benbasat  and  Taylor,  1978; 
Dickson  et  al.,  1977;  Hanson,  1979;  Ives,  Hamilton  and 


Davis,  1980;  Mason  and  Mltroff,  1973;  Watson  and  Driver, 
1983;  Zmud,  1979).  They  feel  that  the  DSS  have  been 
proliferating  with  high  praise  for  the  technology 
advancements  themselves.  However  It  appears  the  growth 
has  been  In  an  ad  hoc  manner  with  efforts  falling  quite 
short  of  realizing  the  maximum  effectiveness  and 
efficiency  of  the  system  design.  Mason  and  Mltroff,  noted 
experts  In  the  discipline  of  IS  research,  delineated 
guidelines  for  this  much  needed  research.  Since  a  DSS  is 
composed  of  a  person  of  a  certain  psychological  type,  a 
class  of  problems,  an  organizational  context,  a  mode  of 
display  and  data  of  a  certain  form,  these  are  the  key 
variables.  In  order  to  design  effective  displays,  these 
variables  must  be  systematically  examined  In  all  possible 
combinations  for  main  and  interaction  effects.  The 
majority  of  research  to  date  has  centered  around  the  type 
of  decision  maker  who  uses  objective  hard  facts  from  a 
computer  pr 1 ntout  1 n  a  proven  formu 1  a  or  mode  1  to  so  1 ve  a 
deterministic  class  of  problems  of  a  nonplanning  nature 
(Dickson  et  al.,  1977;  Mason  and  Mltroff,  1973;  Lucas, 
1981).  Not  all  DSS  are  described  by  these  specific 
characteristics  and  it  Is  naive  to  assume  that  DSS 
requirements  do  not  vary  as  a  function  of  the  above 


variables 


There  fs  a  growing  interest  in  the  mode  of  visual 
display  of  the  DSS  as  it  is  presumed  to  have  a  major 
impact  on  if  and  how  the  decision  maker  uses  the  DSS 
(Dickson  et  a!.,  1977;  Lucas,  1981).  With  the  continuing 
technological  advancements,  decision  makers  have  choices. 
They  can  use  a  CRT  or  hardcopy  man/machine  Interface. 
Their  data  can  be  presented  on  these  interfaces  in  text, 
graphics  or  mixed  format.  This  information  can  be  coded 
in  color  or  achromatic  pattern. 

Color  Graphic  Displays  in  DSS 

There  is  a  paucity  of  research  in  the  area  of 
computer  graphics  in  IS  and  there  is  a  current  need  for 
it  especially  in  a  laboratory  environment  where  variables 
can  be  controlled  and  sufficient  number  of  subjects  tested 
(DeMars,  1975;  Dickson  et  a!.,  1977;  Haber  and  Wilkinson, 
1982;  Lucas,  1981;  Tull  is,  1981;  Wallace,  1981;  Waller,  et 
al.,  1982;  Watson  and  Driver,  1983).  One  reason  for  this 
is  that  prior  to  1973  the  effects  on  the  human  operator 
were  neglected  in  IS  research  (Benbasat  and  Taylor,  1978; 
Parsons,  1970).  The  Minnesota  Experiments  were  a  start. 
These  experiments  addressed  the  human/machine  interface 


and  format  options  but  did  not  include  color.  Follow-up 
and  succeeding  studies  have  examined  these  variables  but 
only  two  considered  color.  These  were  Dooley  and  Harkins 


65 


(  1970)  and  Tull  Is  (1981)  discussed  earlier.  Dooley  and 
Harkins  used  colored  graphics  but  not  as  the  recognized 
source  of  the  subjects'  task  at  hand.  Tunis  (1981)  used 
colored  graphics  In  a  training  and  diagnostic  task  of  a 
worker  which  did  not  Involve  memory.  A  CRT  was  the  mode 
of  visual  display.  Color  was  compared  to  achromatic  shape 
coding  In  the  graphical  displays. 

There  Is  no  doubt  that  color  Is  a  very  attractive 
feature  and  Is  used  extensively  in  DSS.  However  there 
still  remains  much  to  be  learned  regarding  when  and  how  to 
use  It  to  Increase  a  decision  maker's  performance. 
Quantitative  Information  about  the  relative  effectiveness 
on  decision  maker's  output  when  using  color  versus 
achromatic  codes  In  graphics  formats  Is  needed  so  that  the 
benefits  color  adds  to  a  DSS  can  be  understood  and 
Incorporated  for  Its  more  efficient  utilization  (Bork, 
1980;  Chard,  1979;  Christ,  1977;  Christ  and  Corso,  1983; 
Denbrook,  1982;  Durrett  et  al.,  1982;  Hall  worth  and 
Brebner,  1980;  Hanson,  1979;  Loceff  and  Loceff,  1981; 
Taylor,  1979;  Truckenbroad ,  1981;  Tull  Is,  1981;  Waller  et 
al.,  1982). 


III.  METHODOLOGY 


I ntroduct i on 


Research 

into 

the 

understanding 

of  an 

DSS 

must 

cons  1 der  a  1 1 

parts 

of 

the  system. 

However , 

not 

al  1 

combinations 

of 

al  1 

parts  of  the 

system 

need 

be 

Investigated  simultaneously  or  in  one  study.  A  referee  to 
Lucas  (1981)  commented  on  the  general  findings  of  the 
1 Iterature: 

I  am  afraid  that  greater  understanding  In 
this  area  (DSS)  will  be  accomplished  through 
many  small  studies  rather  than  by  some  major 
breakthrough  (p.  758). 

With  this  in  mind  a  study  was  designed  whose 
objective  was  to  gain  insight  on  how  the  attribute  of 
color,  when  used  functionally  in  computer  generated  mixed 
graphics  displays,  affects  the  performance  of  a  decision 
maker  who  has  an  engineering  background.  In  this  chapter 
the  research  method  is  identified  first.  Using  Mason  and 
Mltroff's  definition,  the  DSS's  key  parts  are  then  defined 
with  those  that  are  considered  variables  for  this  study 
expounded  upon  in  detail.  Pretesting  and  Its  results  are 
then  described.  The  chapter  concludes  with  the  procedures 
used  in  carrying  out  the  study. 


organizational  setting  (Mason  and  Mitroff,  1973).  The  DSS 
of  interest  In  this  research  involved  decision  makers  who 


have  an  academic  engineering  background.  The  problem  was 
classified  as  structured  with  decisions  made  under 
certainty  as  the  data  were  deterministic  in  nature.  The 
data  were  presented  in  a  mixed  graphics  form  either  in  one 
of  four  color  alternatives  or  monochrome.  The  man-machine 
interface  was  computer  generated  hardcopy.  The 
organizational  context  was  that  of  management  control  of 
resources. 

Pqpu 1  at i on .  Personal  and  background  differences  have 

been  found  to  influence  the  way  in  which  decision  makers 
process  information  and  solve  problems  (Dickson  et  al, 
1977,  Lucas,  1980,  Mason  and  Mitroff,  1973,  Zmud,  1979). 
The  population  of  Interest  is  decision  makers  who  have  an 
academic  engineering  background.  It  is  assumed  those 
students  enrolled  In  the  Engineering  College,  ASU  and 
taking  summer  coursework  as  a  group  do  not  differ 

significantly  in  their  decision  processes  from  those  in 
the  engineering  workforce  making  the  managerial  type 
decisions  based  on  deterministic  facts.  In  fact,  many  of 
the  students  included  in  the  sample  have  been  or  were 
presently  a  member  of  the  engineering  workforce. 


The  sample  population  came  from  those  enrolled  in 


engineering  courses  taught  during  the  Summer  1983  academic 
sessions.  The  experimenter  visited  six  courses  (two 
undergraduate  and  one  graduate  class  in  probability  and 
statistics  as  well  as  two  undergraduate  engineering 

graphics  classes  and  one  graduate  decision  analysis  class) 
soliciting  volunteers.  The  experiment  was  briefly 

explained  and  a  sign-up  sheet  passed  among  the  students. 
All  but  one  class  instructor  offered  the  incentive  of 
homework  points  for  participation.  The  majority  of  each 
class  volunteered.  Over  99%  of  those  who  signed  up  for 
the  experiment  came  on  the  scheduled  day  and  120  of  them 
completed  the  experiment. 

The  declared  academic  majors  of  those  who  volunteered 
included  aeronautical,  biological,  chemical,  civil, 

industrial  and  mechanical  engineering  as  well  as  physics, 
mathematics  and  computer  science.  The  characteristics  of 

interest  of  the  sample  used  in  this  study  are  given  in 

detail  In  Table  3.1,  Characteristics  of  Engineering 

Student  Sample  along  with  how  many  of  the  sample  reported 
those  characteristics.  These  data  were  extracted  from  the 
Background  and  Experience  quest ionnaire  (Appendix  A)  each 
subject  filled  out  prior  to  participating  In  the  actual 
experiment.  The  breakdown  into  levels  of  each 
characteristic  was  based  on  either  a  natural  split  or 


Table  3.1 

CHARACTERISTICS 

of 

ENGINEERING  STUDENT  SAMPLE 


Characteristic 

Samp  1 e 

Age  (years) 

17-22 

41 

23-28 

52 

29+ 

27 

Sex 

Female 

33 

Male 

87 

College  Experience  (years) 

<3 

37 

>3  and  <5 

30 

25 

53 

Education  Level 

<2  years 

25 

2.2  yrs  and  not  Graduate  status 

66 

Graduate 

29 

CRT  Experience 

Not  a  User  (Never  Use) 

33 

Moderate  User  (0-6  hrs/wk) 

44 

Frequent  User  (>6  hrs/wk) 

43 

Attitude  toward  Computer  Use 

Uncomf ortab 1 e 

31 

Comf ortab 1 e 

48 

Very  Comfortable 

41 

Video  Game  Experience 

Do  Not  or  Seldom  Play  (0-3  hrs/wk) 
Moderate  Player  03  hrs/wk) 


39 

81 


requirements  of  the  analysis  techniques  used.  If  a 
natural  split  was  not  obvious,  the  levels  were  decided 
based  on  a  likely  natural  split  as  well  as  the  division  of 
the  sample  character  1 st 1 cs  when  tabulating  them  in 

accordance  with  the  five  treatment  levels  of  color.  Two 
criteria  were  followed.  The  levels  defined  were  likely. 
Second,  there  had  to  be  at  minimum  two 
subjects/treatment/characteristic  level.  This  was  data 
dependent  because  the  only  independent  variable  under 
experimental  control  was  the  treatment  levels.  The  other 
independent  variables  were  based  on  the  sample,  who,  under 
the  assumption,  share  the  same  characteristics  as  a  group 
as  the  engineering  workforce.  The  levels  of  each 
characteristic  factor  are  defined  below. 

Age  was  divided  Into  three  levels.  A  six  year  span 
was  represented  in  each  of  the  first  two  levels  17  to  22 
years  and  23  to  28  years.  The  oldest  group,  29+  years. 
Included  the  remaining  of  the  sample,  making  the 
categories  mutually  exclusive  and  exhaustive. 

Sex  levels  were  based  on  a  natural  split  of  female 
and  male.  The  male/female  unbalance  was  expected  as  It 
reflected  the  proportion  of  each  sex  In  the  engineering 
student  population.  This  male/female  ratio  may  favor 


women  more  than  In  the  overall  engineering  workforce 


because  more  women  are  now  entering  the  engineering 
discipline  to  train  for  the  workforce. 

College  experience  represented  the  time  each  has  been 
exposed  to  a  college  level  academic  train  of  thought 
process  without  controlling  for  the  specific  education 
level.  Three  levels  were  defined:  less  than  3  years,  3 
years  but  less  than  5  years,  and  at  least  5  years. 

Education  level  defined  the  extent  of  educational 
experience  rather  than  years  in  school.  Those  who  have 
less  than  two  years  of  college  have  generally  taken  only 
Introductory  courses.  It  is  assumed  those  who  have  had  at 
least  two  years  and  were  undergraduates  have  been  exposed 
to  a  higher  level  of  academic  thought  process.  Graduate 
students.  It  is  assumed,  are  taught  at  a  different  level 
that  is  more  demanding  and/or  creative  than 
undergraduates . 

CRT  experience  followed  a  natural  breakdown.  There 
were  those  who  do  not  use  it  (Never  Use),  interface  with 
it  on  a  moderate  basis  (0  to  6  hours/week),  or  frequently 
use  It  (more  than  6  hours/week). 

Attitude  towards  Interfacing  with  the  computer  also 
followed  a  natural  breakout.  Subjects  reported  they  were 
either  very  uncomfortabl e  or  uncomfortabl e 
(Uncomfortable),  hesitant  or  comfortable  ( Comf ortab 1 e ) ,  or 


very  comfortable  (very  Comfortable). 

The  last  variable  of  Interest  was  video  game 

experience.  Many  video  games  require  decision  making  of 
some  type  and  use  the  CRT  as  the  human-machine  interface. 
Most  commercial  as  well  as  the  majority  of  home  video 
games  are  played  in  color.  To  investigate  if  video  game 
experience  affected  the  performance  across  the  different 
color  treatments,  decision  makers  were  categorized  In  one 
of  two  levels.  Either  a  subject  did  not  or  seldom  played 
video  games  (0  to  3  hours/week)  or  was  a  moderate  player 
03  hours/week).  The  frequent  player  category  was  not 

included  as  only  two  of  the  sample  reported  playing  more 
that  6  hours/week.  Originally  It  was  intended  to  break 
out  users  as  to  type  played:  color  or  monochrome. 

However,  only  two  of  the  users  reported  playing  in 

monochrome  which  was  not  a  sufficient  sample  size  for  this 
analyst  s. 

The  one  characteristic  that  was  considered  but  not 
detailed  In  the  table  Is  color  blindness.  Since  one  of 
the  key  variables  of  the  study  is  col  or /monochrome  and  the 
treatments  were  assigned  at  random,  the  experimenter 
excluded  anyone  who  had  a  potential  of  being  color  blind. 
To  test  this  potential  the  Dvorine  Pseudo- I sochromatic 
Test  was  administered  to  each  volunteer.  Linksz  (1964,  p. 
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238)  feels  "as  a  screening  device  the  Dvorine  Test  is  by 


far  the 

superior  one 

(over 

the 

most 

popular  Ishlhara 

Test) . " 

The  critical 

score 

was 

3  or 

more  missed  which 

Indicates  a  potential  color  vision  problem. 

Instrument.  To  satisfy  the  objective  of  the  study  a 
problem  type  was  chosen  that  represented  a  general  class 
of  problem  and  thought  process  typical  of  a  decision  maker 
with  an  engineering  background.  Based  on  these  criteria* 
the  experience  of  the  literature,  the  contents  of  several 
engineering  textbooks  and  the  level  of  the  sampled 
population,  an  Inventory  control  case  study  was  selected 
as  the  content  of  the  decision  process. 

The  logistics  management  game  has  been  the  tool  used 
over  the  years  by  MIS  researchers  to  measure  the  impact  of 
an  Information  system  on  user  performance  (Dickson  et  al., 
1977;  Lucas,  1980).  The  game,  played  at  Stanford 
University  and  other  research  institutions  for  years. 
Involves  the  subject  assuming  the  role  of  a  decision  maker 
In  the  management  of  product  logistics.  The  decision 
maker,  who  Is  given  such  facts  as  beginning  inventory, 
demand  level,  costs,  revenues,  etc.,  is  asked  to  maximize 
profits.  A  form  of  this  game  was  used  in  the  Minnesota 
Experiments.  Dickson  et  al.  (1977)  also  used  a 
modification  of  this  game  as  the  research  vehicle  in  their 
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I 

nine  experiments  aimed  at  gaining  insight  into  the  effect 
of  cognitive  style  and  interface  output.  Lucas  (1980)  too 
|  employed  a  form  of  this  logistics  management  game  to 

simulate  a  OSS  when  he  investigated  the  Impact  of  the  mode 
of  Information  presentation  (hardcopy,  CRT,  tabular  or 
i  graphics  form).  In  his  conclusions  he  suggested  the  game 

was  too  complex  and  recommended  It  be  simplified  when  used 
as  an  experimental  tool  In  further  DSS  research. 

!  The  Inventory  control  problem  in  Its  simplest  form 

can  be  found  In  many  basic  engineering  textbooks. 

Including  those  on  production  control,  operations 
research,  decision  analysis  among  others.  Because  of  this 
and  Its  historic  use  In  MIS  research,  this  problem  type 
was  a  prime  candidate  for  this  Investigation.  An  example 
^  problem  was  found  In  HI  l  Her  and  Lleberman  (1974)  under 

Inventory  theory  that,  when  revised,  fit  the  objectives  of 
this  research,  was  similar  In  nature  to  the  logistic  game 
that  had  been  validated  In  the  literature  for  DSS 
research,  and  was  a  basic  engineering  problem. 

The  selected  Inventory  case  study  was  deterministic 
Instead  of  probabilistic  like  those  used  In  literature. 
Uncertainty  would  have  made  the  problem  more  complex  as 
well  as  limited  the  population  of  users  to  those  who  were 
familiar  with  probability  and  statistical  theory.  Lucas 
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(1980)  recommended  the  problem  be  simplified.  Also, 
deterministic  problems  have  exact  answers  which  eliminated 
any  bias  of  the  experimenter's  own  decision  making  style 
when  scoring  the  results  of  the  case  study.  Another 
reason  for  making  the  problem  deterministic  was  the 
comment : 

One  of  the  most  widely  observed  phenomena  of 
management  dec i s i on  mak i ng  is  the  genera  1 
suppression  of  uncertainty.  In  the  vast 
majority  of  instances,  it  would  appear  that 
uncertainty  is  not  dealt  with  in  any  explicit 
fashion  (Morris,  1977,  p.  30). 

Therefore  by  making  the  Inventory  problem  deterministic  it 

became  simpler  and  workable  by  a  larger  population,  yet 

without  a  great  loss  of  reality  since  most  problems  are 

handled  deterministically  anyway. 


The 

case  study 

1  s 

presented 

i  n 

its  entirety  in 

Appendix 

B.  Though 

the 

problem 

type 

i s  common  i n  many 

engineering  texts,  formal  academic  training  on  inventory 
theory  was  not  a  prerequisite  for  volunteers.  The 
mathematical  skills  and  scientific  approach  underlying 
most  basic  engineering  coursework  were  all  the  tools 
necessary  to  work  the  case  study.  Calculators,  rulers  and 
complex  formulas  were  not  needed  or  used  in  the 
experiment.  The  subjects  brought  only  a  pen  or  pencil. 

The  case  study  was  divided  into  four  scenarios,  each 
building  on  the  original  facts.  The  arrangement  and  type 


of  decisions  to  be  made  (questions)  as  well  as  the  timing 
are  discussed  under  the  measure  of  performance  section. 
An  explanation  of  the  particular  format  of  the  case 
study's  data  is  given  in  the  next  section. 

Mode  of  Presentation.  The  format  or  mode  of 
presentation  in  which  data  are  given  to  the  decision  maker 
has  elicited  considerable  attention  of  researchers  as  it 
Is  a  key  variable  In  the  DSS's  definition.  Previous 
studies  have  Investigated  aspects  of  modes  such  as 
narrative  versus  tabular  versus  graphical,  line  versus  bar 
versus  pie  graphs,  among  others.  This  study's  basic  mode 
is  computer  generated  mixed  graphics.  IBM  defines  mixed 
graphics  as  information  that  is  displayed  in  text  and 
picture  format  (Benson,  1982).  This  particular  form  was 
chosen  for  several  reasons.  Primarily  the  objective  of 
the  study  was  to  Investigate  the  impact  of  the  attribute 
of  color  on  the  performance  of  the  decision  maker. 
Initially,  after  researching  the  software  and  hardware 
available  that  supported  color.  It  was  discovered  that 
most  restricted  the  use  of  color  to  the  graphical  part  of 
the  display.  Therefore,  those  decision  makers  using 
computer  generated  information  that  had  the  attribute  of 
color  were  most  likely  using  graphical  output.  The 
assumption  is  made  that  even  with  the  emerging  available 


and  affordable  technology,  the  graphics  will  remain  the 
predominant  form  of  output  possessing  the  attribute  of 
color. 

Another  reason  mixed  graphics  output  was  selected  as 
the  format  was  because  it  is  the  most  commonly  used  format 
with  the  class  of  problem  under  study.  This  is  supported 
by  the  literature  (Dickson  et  al.,  1977;  Lucas,  1980)  as 
well  as  by  the  software  available  that  is  written  for  this 
class  of  problem  (Benson,  1982;  Kirby,  1979;  McLamb, 
1982).  The  most  popular  of  the  graphs  are  bar/column, 
line  and  pie  charts.  This  study  used  the  column 
(vertical)  charts  because  they  satisfied  the  objectives  of 
the  study,  fit  the  data  presented,  are  commonly  used  and 
most  people  understand  them. 

The  text  part  of  the  display  complemented  the  chart's 
information.  Its  purpose  was  to  define  the  legend  of  the 
column  charts  as  well  as  give  detailed  information  not 
readily  available  at  a  glance  on  the  charts.  Each  graphic 
display  was  designed  to  stand  alone  without  further 
documentat i on . 

The  graphics  standards  as  specified  by  Francis 
(1962),  Mclamb  (1982)  and  Schmidt  and  Schmidt  (1979)  were 
followed  when  designing  the  mixed  graphic  displays.  The 
guidelines  concerning  the  text,  labels,  scale,  baseline. 


grfdlines,  graph  shape,  axis,  origin,  dimension,  spacing 
and  grouping  were  incorporated  into  the  layout.  The 
patterns  and  alphanumeric  text  were  computer  software 
generated  by  the  DI  (Device  Independent)  3000  language, 
thus  satisfying  engineering  graphics  standards. 

There  were  four  consecutive  scenarios  to  the  case 
study.  The  first  three  defined  the  facts  of  the  entire 
case  study,  with  each  succeeding  scenario  building  on  the 
previous  one.  The  first  three  scenarios  were  essentially 
alike.  They  simulated  a  scenario  of  a  decision  maker  who 
had  to  order  the  Inventory  for  the  summer  (Scenario  1). 
Then  the  decision  maker  was  informed  the  fall  order  had  to 
be  placed  to  the  suppliers  with  the  summer  order  (Scenario 
2)  and  discovered  what  was  ordered  in  the  summer  had  a 
significant  Impact  on  what  was  ordered  for  fall  if 
maximizing  total  profits  was  the  objective  (Scenario  3). 
In  each  part,  necessary  background  to  the  scenario  was 
presented  in  text  form.  This  information  was  translated 
into  the  mixed  graphics  displays  with  one  display  per 
Inventory  item.  These  graphics  were  followed  by  the 
decisions  that  needed  to  be  made  which  were  in  multiple 
choice  question  format.  Because  the  problem  was 
deterministic  in  nature  there  could  be  and  was  a  best 
objective  answer  for  each  question  that  maximized  profit 


despite  the  decision  maker's  style.  In  the  first  scenario 
all  questions  were  answerable  using  the  mixed  graphics 
displays  In  that  part.  For  the  second  scenario's 
decisions*  for  which  the  first  scenario's  displays  were  no 
longer  available  for  review,  both  the  first  and/or  second 
mixed  graphic  information  were  needed.  The  third  section 
of  the  case  study  had  the  mixed  graphic  displays  that 
reflected  the  changes  in  the  scenario,  but  whose  questions 
were  based  on  al 1  three  parts.  The  fourth  scenario  (post 
test)  section  of  the  case  study  did  not  contain  any  new 
Information  or  mixed  graphic  displays.  It  consisted  of 
decisions  (multiple  choice  questions)  that  simulated  a 
conversation  with  the  next  level  of  management  on  the 
conclusions  the  decision  maker  made  during  the  analysis 
portion  of  the  case  study. 

The  primary  objective  of  this  research  was  to 

investigate  the  impact  of  color  when  used  functionally  on 
the  performance  of  the  decision  maker.  Color  is  used 

functionally  if  It  is  the  primary  information  code  (Dooley 
and  Harkins,  1970).  The  display,  shown  in  Figure  3.1, 
Mixed  Graphic  Display,  consisted  of  the  legend  of  graphics 
and  text  and  the  column  charts  of  breakdown  (left  chart) 
and  total  inventory  (right  chart)  dollar  figures.  There 
was  one  display  per  Inventory  item/scenario.  The  color  or 
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Figure  3.1  Mixed  Graphics  Display 
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monochrome  code  in  the  legend  allowed  the  decision  maker 
to  discriminate  between  the  information  available  in  the 
breakdown  ( 1  eft )  chart . 

The  literature  recognizes  that  there  Is  a  void  of 
research  in  the  area  of  determining  what  colors  should  be 
used.  Most  studies  chose  their  colors  arbitrarily  unless 
they  Involved  situations  where  specific  colors  were  used 
because  of  stereotyping.  Using  one  set  of  colors  and  a 
monochrome  configuration,  the  performance  of  each  was 
compared.  This  experiment  compared  several  sets  of  colors 
with  the  monochrome  patterns  in  an  attempt  to  determine  if 
it  was  the  presence  of  color  or  a  specific  set  of  colors 
that  had  an  affect,  If  any,  on  performance.  Four  sets  of 
color  combinations  and  one  of  monochrome  pattern  were 
selected  as  the  levels  of  treatment.  Three  criteria  were 
followed  in  determining  which  color  coded  what 
information.  First,  stereotyping  was  adhered  to  where 
appropriate.  Green,  If  used,  always  represented  a  revenue 
while  red  coded  a  cost.  Second,  two  colors  of  the  same 
hue  were  not  placed  adjacent  to  each  other  in  order  to 
avoid  confusion.  Finally,  the  remaining  colors  were 
placed  at  random.  Color  usage  was  compatible  across  all 
displays  as  a  particular  color  always  symbolized  the  same 
Information  (Keister,  1981). 


The  first  set  of  colors,  subsequently  coded  CRT  in 
the  tables  and  text,  were  the  primary  colors  preset  on  the 
majority  of  CRTs  that  support  color.  This  set  of  colors 
is  currently  the  most  often  used.  These  included  green, 
red,  yellow,  blue,  dark  blue,  cyan,  black  and  white.  The 
second  treatment,  subsequently  labeled  RND,  consisted  of  a 
random  sample  of  colors  that  are  available,  either  primary 
or  user  specified,  on  the  CRT  colorwheel.  The  saturation 
was  the  same  as  in  treatment  one  (CRT).  This  set  Included 
green,  hot  pink,  white,  turquoise,  purple,  orange,  yellow 
and  black.  The  third  set,  subsequently  noted  as  OUT  in 
the  tables  and  text,  were  consistent  with  the  first 
treatment:  green,  red,  blue,  yellow,  white,  black, 
turquoise  and  cyan.  However,  the  legend  squares  and 
column  bars  were  only  outlined  in  these  colors,  not 
solidly  filled.  The  fourth  treatment,  noted  as  PAS,  used 
a  random  selection  of  pastel  colors.  The  intensity  of  the 
color  was  significantly  less  that  that  in  the  CRT  or  RND 
treatment.  This  set  was  made  up  of  lime  green,  pink, 
yellow,  white,  purple,  grey,  blue  and  flesh.  The  exact 
identification  of  each  color  is  discussed  in  the  next 
section  when  explaining  the  Interface  used. 

The  fifth  treatment  was  monochrome  intensity  pattern. 
Computer  software  graphics  language  packages  often  offer 
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interior  intensity  patterns  that  can  be  used  when  using  a 
monochrome  graphics  display.  Eight  patterns  of  those 
available  in  the  D I -3000  graphics  language  used  to 
generate  the  displays  were  selected.  Non-similarity  in 
pattern  was  the  criterion  used  when  deciding  which  eight 
to  use.  The  patterns,  shown  in  Figure  3.1,  correspond  to 
22,  23,  24,  25,  26,  27,  29  and  36  as  shown  in  the  DI-3000 

Users  Manual  (1979),  p.  A-97.  Like  the  colors,  these 
patterns  filled  in  the  legend  squares  and  column  bars  and 
were  consistent  throughout  all  displays. 

Interface..  Decision  makers  utilize  various 
interfaces  when  interacting  with  their  particular  MIS. 
Hardcopy  has  been  the  most  prevalent  with  the  CRT  gaining 
due  to  the  availability  of  both  affordable  and  user 
friendly  hardware,  software  and  service.  Although  in  the 
near  future  the  CRT  will  probably  be  on  par  or  surpass 
hardcopy  as  the  primary  Interface,  it  is  assumed  the  CRT 
will  never  replace  hardcopy  altogether. 

The  original  intent  of  this  research  was  to  utilize 
the  CRT  as  the  man-machine  interface.  However  hardware 
constraints  were  encountered  which  prohibited  its  use. 
The  case  study  graphics  were  programmed  using  the  DI-3000 
graphics  language  software  package.  This  software  was 
selected  because  it  was  a  Fortran  based  graphics  language 
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that.  while  being  supported  on  the  Arizona  State 
University  (ASU)  Harris  800  computer  system,  was 
compatible  with  any  interface  graphics  termfnal  which  had 
access  to  the  Harris.  Oue  to  efficiency  of  the  hard  wired 
terminal,  the  program  (Appendix  C)  was  written,  debugged 
and  pretested  on  the  Genesco  and  Imlac  graphics  terminals. 
These  interfaces  support  only  monochrome,  handle  a  9600 
baud  rate  and  have  a  18  inch  screen.  The  program  was 
written  In  a  way  that  allowed  the  decision  maker  to  access 
any  graph  in  a  particular  scenario  as  many  times  as 
desired.  The  average  time  to  paint  the  graph  on  the 
screen  was  11  seconds  +/-  2  seconds,  depending  on  the 
number  of  users  accessing  the  Harris  system.  The  program 
was  self-explanatory  even  to  those  who  were  not  familiar 
or  comfortable  with  interfacing  with  the  computer.  The 
program  gave  screen  instructions,  displayed  the  graphical 
chart  requested  by  hitting  the  menu  number  on  the 
keyboard,  denied  access  to  any  graph  not  appropriate  to 
the  portion  of  the  case  study  being  worked,  counted  the 
total  number  of  graphs  viewed  and  clocked  the  viewing  time 
of  each  section.  It  was  written  so  the  decision  maker 
would  receive  preliminary  instructions  from  the 
experimenter  at  the  onset,  then  would  work  alone  with  the 
DSS  until  ready  for  the  post  test.  The  menu  of  graphics. 
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Instructions  and  graphics  were  displayed  on  the  CRT  with 
the  updated  scenario  and  decisions  (multiple  choice 
questions)  printed  on  hardcopy. 

During  the  process  of  writing  the  software  program, 
portions  of  It  were  tested  using  the  Tektronlcs  4027  as 
the  Interface.  The  Tektronlcs  terminal  was  the  Interface 
originally  Intended  for  use  In  the  experiment  because  It 
was  the  only  terminal  available  that  accessed  the  Harris 
and  supported  color  and  graphics.  During  the  time  the 
case  study  was  being  programmed,  the  Tektronlcs  4027  was 
In  a  location  that  mandated  access  to  the  Harris  via  a 
modam  which  allowed  a  300  baud  rate.  Portions  of  the 
program  were  tested  as  they  were  developed  on  the 
Tektronlcs  4027  and  were  found  to  work  the  same  as  on  the 
Genesco  and  Imlac  except  for  the  paint  time  of  the  charts. 
Because  of  the  baud  rate  (300  Instead  of  9600)  and  the 
raster  scan  process,  the  paint  time  of  the  chart  was  five 
minutes  as  opposed  to  11  seconds.  Arrangements  were  made 


to  relocate  the 

Tektronlcs  4027  In 

the 

ASU 

Comput 1 ng 

Services 

Computer 

Graphics  Site  where 

It 

could 

be 

hardwired 

to  the 

Harris,  thus  have 

the 

potential 

of 

receiving 

at  a  9600 

baud  rate. 

The  Tektronlcs  4027  terminal  was  hardwired  to  the 
Harris  at  the  time  of  pretesting.  The  program  was  brought 
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Into  memory.  After  the  Initial  Instructions  and  four 
charts  were  brought  to  the  screen,  the  Tektronics  terminal 
would  no  longer  support  the  case  study  program.  The 
Manager,  Graphics  Computing,  Mr.  J.  Howard,  analyzed  the 
problem  over  a  two  month  period.  He  Identified  a  problem 
in  the  D I -3000  software,  notified  the  authors  of  the 
language  and  suggested  a  fix.  It  was  implemented  in  the 
updated  version  of  01-3000  language  but  this  did  not  solve 
the  problem.  The  Tektronics  hardware  was  checked  and  its 
memory  upgraded.  Despite  these  efforts  over  a  three-month 
period,  the  Tektronics  4027  would  not  support  the  case 
study  program.  In  Mr.  Howard's  expert  opinion,  it  was  the 
hardware  of  the  Tektronics  4027  that  was  not  compatible  to 
the  relatively  new  graphics  language  despite  its  device 
independency.  The  program  executed  as  intended  on  both 
the  Genesco  and  Imlac  terminals,  but  neither  of  these 
graphic  Interfaces  supported  the  option  of  color.  Color 
terminals  with  drivers  written  that  would  allow  access  to 
the  Harris  mainframe  were  not  scheduled  on  line  until  late 
Fall  1983,  which  did  not  fit  the  schedule  of  this  study. 
Therefore,  the  other  primary  Interface  used  by  engineers 
in  many  of  their  decision  making  sessions,  either  one  on 
one  or  In  a  meeting  environment,  was  chosen  for  this 
research.  This  interface  is  hardcopy. 


The  hardcopy  mixed  graphics  displays  were  generated 
by  building  a  metafile  while  painting  the  display  on  the 


CRT  screen.  A  program  was  written  that  directed  the 
images  to  be  routed  to  the  Versatec  Plotter  where  the 

hardcopies  were  produced.  The  hardcopy  graphics  displays 
were  exact  Images  of  those  given  on  the  CRT  interface. 
Two  complete  sets  of  the  graphic  displays  were  generated. 
The  first  set,  whose  generator  program  Included  the 
intensity  patterns,  were  used  as  the  monochrome  treatment. 
The  second  set  did  not  have  the  legend  squares  or  column 
bars  filled,  just  outlined  in  the  black  ink  as  a  plotter 
supporting  color  was  not  available.  The  four  treatments 
Involving  color  used  this  second  set  as  their  base. 

Color  was  added  to  the  graphics  displays  by  the 

experimenter.  Using  " Rente 1"  felt  tip  pens,  the  legend 
squares  and  column  bars  were  filled  or  outlined  In  the 

appropriate  schematic  as  described  above  for  the  CRT 

colors  (CRT),  the  random  colors  (RND)  and  the  outline 
colors  (OUT).  "Venus"  colored  pencils  were  used  to  create 
the  pastel  (PAS)  treatment.  Appendix  D  Identifies  the 
specific  patterns  and  colors  used  for  each  treatment. 

Organ i zat i on  Context.  The  final  part  of  the 
definition  of  a  DSS  considers  the  organizational  context. 
Mitroff  and  Mason  (1973)  suggested,  based  on  Anthony's 


l 


approach,  one  way  to  incorporate  this  variable  Is  to 


discuss  the  kinds  of  problems  organizations 
typically  "choose”  to  solve.  This  way  says  that 
organizational  structure  Is  a  reflection  of  the 
levels  of  problems  the  organization  considers 
Important  to  solve  (p.  483). 

Of  the  three  types  (strategic  planning,  management 

control,  operational  control)  of  major  problems 

organizations  typically  face,  this  case  study  placed  the 

decision  maker  Into  the  management  control  position.  The 

objective  of  the  decision  maker  was  to  assure  that  the 

proper  Inventory  was  obtained  and  resources  used 

efficiently  and  effectively  in  the  accomplishment  of  the 

organizat Ion's  goal  of  maximizing  total  profit.  This  type 

of  problem  satisfied  Anthony's  definition  of  management 

control.  Therefore,  for  this  research  the  organ  1 zat 1 ona 1 

context  was  considered  a  constant  and  was  defined  as 

managerial  control. 

Performance  Measures 

The  objective  of  this  study  was  to  analyze  how  the 
attribute  of  color,  when  used  functionally  In  computer 
generated  graphical  displays,  affects  the  performance  of  a 
decision  maker.  King  and  Rodriguez  purport  "decision 
performance  assessments  reflect  the  quality  of  the 
decision  making  process  that  Is  supported  by  the  MIS" 


(King  and  Rodriguez,  1978  p.  45).  Quality  of  decision 
making  was  measured  in  two  of  the  traditional  terms  for 
this  research:  time  and  quality  of  decision. 

Time.  The  research  literature  considers  time  to  be 
one  of  the  most  important  of  the  measures  of  performance 
in  either  the  area  of  color  or  DSS.  Time  was  defined  in 
three  ways  for  this  study:  Total  Time,  No  Memory  Time  and 
Memory  Time.  Total  Time  was  the  number  of  seconds  a 
decision  maker  used  when  solving  the  first  three  parts  or 
Interactive  parts  of  the  case  study.  No  Memory  Time  was 
the  number  of  seconds  used  when  the  task  involved  no 
memory  but  only  direct  reading  from  one  or  more  graphs. 
This  was  the  elapsed  time  for  solving  the  first  scenario 
of  the  case  study.  Memory  Time  was  the  amount  of  time 
taken  to  solve  those  parts  of  the  case  study  which 
required  memory  for  some  of  the  decisions.  Memory  Time 
was  measured  as  the  amount  of  time  (seconds)  it  took  the 
decision  maker  to  complete  the  second  and  third  scenarios 
of  the  case  study.  Time  was  not  controlled  in  the  post 
interview  (test)  or  non  Interactive  scenario  of  the 
exper i ment . 

The  experimenter  employed  a  fellow  graduate  student 
to  measure  and  record  the  times.  A  digital  watch  was 
used.  Four  times  were  taken:  start  times  for  the  first 


two  scenarios,  and  end  times  for  the  first  and  third 
scenarios.  The  same  start  time  was  recorded  for  the 
entire  group  when  the  experimenter  signaled  them  to  begin 
the  first  scenario.  At  the  end  of  the  first  scenario,  the 
subjects  were  Instructed,  both  verbally  before  they 
started  as  well  as  In  a  written  direction  after  the  last 
question  of  the  first  scenario,  to  raise  their  hand.  The 
timer  recorded  their  time  when  their  hand  was  raised.  At 
that  time  the  experimenter  took  the  first  scenario  from 
the  subject  and  handed  out  the  next  two  parts.  The 
experimenter  signaled  the  timer  when  the  subject  began  the 
second  scenario.  As  Instructed  for  the  first  part,  the 
subject  raised  their  hand  when  finished  with  the  third 
scenario  which  the  timer  recorded. 

Qua 1 1 tv  of  Dec  1 s I  on .  As  discussed  In  the  literature 
review  chapter,  quality  of  decision  (accuracy)  Is  another 
very  pertinent  measure  of  performance  when  Investigating 
the  Impact  of  color  or  assessing  the  quality  of  a  OSS. 
This  research  measured  accuracy  In  many  different  ways  In 
order  to  determine  if  the  affect  of  color  was  a  function 
of  the  type  of  decision  under  consideration.  The  types  of 
decisions  and  the  questions  used  to  measure  the 

performance  are  defined  below.  Each  operational 


definition  cites  the  questions  which  compose  the  score. 
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They  are  referenced  by  a  number  which  indicates  the 
scenario  and  specific  question  of  that  scenario.  All 
questions  are  listed  in  the  case  study  package  located  in 
Appendix  B.  The  unit  of  measure  for  all  scores  was  per 
cent. 

Total  Score  was  the  sum  of  the  scores  of  al 1  the 
questions  in  the  case  study,  1-1  through  1-9,  2-1  through 

2- 4,  3-1  through  3-7  and  4-1  through  4-11. 

Sum  Score  was  the  sum  of  the  scores  of  the 
interactive  (scenarios  1,  2,  and  3)  questions  of  the  case 
study.  It  included  1-1  through  1-9,  2-1  through  2-4  and 

3- 1  through  3-7. 

Scenario  1  Score  was  the  sum  of  scores  achieved  in 
the  first  scenario,  1-1  through  1-9.  No  memory  was 
required  to  work  this  section  of  the  case  study. 

Scenario  2  and  3  Score  was  the  sum  of  scores  achieved 
In  the  second  and  third  scenarios  (2-1  through  2-4  and  3-1 
through  3-7).  Some  of  these  questions  required  memory  of 
what  was  presented  or  decided  during  the  previous 
scenar 1o(s) . 

Post  Test  Score  was  the  sum  of  the  points  earned  on 
the  fourth  or  interview  scenario  of  the  case  study  (4-1 
through  4-11).  No  new  Information  or  previous  information 
were  presented,  none  of  the  graphic  displays  were 
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available  for  review  and  the  session  was  not  timed. 

Direct  Read  1  Graph  Score  was  the  sum  of  scores  of 
questions  requiring  only  information  read  from  only  one 
graphic  display.  Neither  memory  or  computation  was 

necessary  to  make  the  decision.  This  score  comprised  the 
points  from  Question  1-2,  1-3,  1-6,  1-9  and  2-4.3. 

Direct  Read  2  Graphs  Score  was  the  sum  of  scores  of 
questions  needing  the  data  that  could  be  directly  read  or 
compared  from  two  or  more  graphic  displays.  Neither 
memory  or  computation  was  necessary  to  make  the  decision. 
The  score  was  the  sum  of  points  from  questions  1-1,  1-4, 
2-1 ,  3-1 ,  3-1  and  3-7. 

Extrapolation  1  Graph  Score  was  the  sum  of  scores  of 
questions  requiring  manipulation  of  the  information  read 
from  one  graphic  display.  No  memory  was  necessary.  The 
score  was  the  sum  of  points  from  questions  1-5,  2-3,  3-4 
and  3-5. 

Extrapolation  2  Graphs  Score  was  the  sum  of  points 
from  questions  that  needed  the  computation  of  the 
Information  taken  from  two  or  more  graphic  displays.  No 
memory  was  required.  This  score  was  the  sum  of  the  points 
from  questions  1-5,  1-7  and  3-6. 

Short  Term  Memory  Score  was  the  sum  of  points  from 
questions  in  the  interactive  scenarios  (parts  1,  2,  3) 


that  required  memory  of  previously  presented  Information 
and/or  decisions.  Computation  was  not  necessary.  This 
score  was  the  sum  of  points  from  questions  2-2,  2-4.1, 
2-4.2,  2-4.4,  2-4.5  and  3-3. 

Long  Term  Memory  Score  was  the  sum  of  the  scores  of 
the  post  test  questions  that  required  only  memory  of 
Information  and  decisions  made  during  the  first  three 
(Interactive)  scenarios.  Computation  of  the  Information 
was  not  necessary.  This  score  was  the  sum  of  the  post 
test  questions  4-1,  4-2,  4-7,  4-8,  4-9  and  4-10. 

Long  Term  Memory  Extrapolation  Score  was  the  sum  of 
scores  of  the  post  test  questions  which  needed 
manipulation  of  Information  presented  during  the 
interactive  scenarios.  This  score  was  the  sum  of  points 
from  the  post  test  questions  4-3,  4-4,  4-5,  4-6  and  4-11. 

Each  question  had  a  correct  optimal  answer  since  the 
problem  was  determlnl st ic  in  nature  with  the  overall 
objective  of  maximizing  profit.  This  correct  answer, 
which  was  always  one  of  the  available  options,  was  worth 
five  points.  Some  of  the  multiple  choice  decisions  had 
available  options  that  were  close  to  the  correct  answer. 
These  answers  were  worth  three  points  because  even  though 
the  decision  was  not  optimal.  It  was  close.  Significantly 
wrong  answers  were  worth  zero  points.  All  questions  were 


weighted  equally  in  the  different  scores. 


The  experiment's  procedures  and  instrument  were 
pretested  to  confirm  the  timing  of  the  experimental 
session,  the  robustness  of  the  computer,  the  clarity  of 
instruct  ions,  the  understanding  of  the  graphical  legend 
and  charts  and  the  Interpretation  of  the  case  study 
questions  and  answers. 

Ten  volunteers  from  a  summer  graduate  engineering 
decision  analysis  course  partook  in  the  pretesting 

experiment.  Each  of  the  ten  sessions  was  conducted  as  if 
It  was  the  actual  experimental  session  with  the  exception 
of  three  factors:  the  human-machine  interface,  the 

environmental  setting  and  the  post  experiment  Interview. 
Procedures 

Each  subject  was  asked  to  read  and  sign  the  volunteer 
consent  form  which  was  written  in  accordance  with  the 
guidelines  set  by  the  Council  on  Human  Experimentation. 
The  consent  form  (Appendix  E)  delineated  the  name  of  the 
experimenter,  the  reason  behind  the  research,  the  purpose 
and  any  risks  Involved  with  participating  in  the  study, 
the  notification  of  withdrawal  without  penalty,  and  the 
voluntary  consent  statement  and  signature  block. 


The  subjects  were  then  requested  to  fill  out  the 
Background  and  Experience  quest lonnal re.  After  each  had 
completed  It,  they  were  Instructed  by  the  experimenter  to 
read  the  Instructions  and  the  first  scenario  of  the  case 
study  which  were  provided  In  hardcopy. 

They  Indicated  to  the  experimenter  when  they  were 
finished  reading  and  ready  to  begin  the  Interactive 
decision  making  part.  The  experimenter  then  verbally 
outlined  the  Instructions  again,  defined  the  terms 
revenue,  cost,  overage,  shortage,  profit  and  loss,  and 
emphasized  the  decisions  were  based  solely  on  the 
information  provided  In  the  case  study's  graphic  displays 
and  text.  The  experimenter  also  reminded  the  subject  that 
once  a  scenario  was  completed  its  Information  and 
decisions  were  not  available  for  review,  answers  could  not 
be  changed  and  the  decisions  in  a  particular  scenario 
could  be  answered  In  any  order.  After  the  experimenter 
answered  any  questions,  the  subject  was  told  to  begin. 

Since  pretesting  was  accomplished  during  the  period 
when  the  CRT  was  still  the  projected  mode  of  display,  it 
became  the  human-machine  Interface.  Both  the  monochrome 
Genesco  and  Imlac  graphic  terminals  were  available  and 
supported  the  program  and  therefore  used.  These  were 
located  at  the  ASU  Computing  Services  Computer  Graphics 
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Site.  This  location  was  also  planned  for  the  actual 
exper 1 ment . 

When  the  experimenter  told  the  subject  to  begin,  a 


key  was 

struck  on 

the 

keyboard 

which 

1 n 1 t 1 ated 

the 

1 nternal 

computer 

clock 

The 

clock 

f unct 1 on 

was 

programmed 

1 nto  the 

case 

study  at 

specific  points 

to 

collect  and  print  out  the  times  needed  for  the  analysis. 
By  the  way  the  clock  was  initiated  and  stopped,  it 
measured  the  elapsed  time  the  same  for  all  subjects. 

The  subject  worked  the  first  scenario  by  referring  to 
the  graphic  displays  that  were  accessible  through  a  menu 
selection  on  the  CRT  screen.  Repeated  viewing  of  any 
graphics  display  pertaining  to  the  scenario  was  an  option. 
A  built-in  counter  was  programmed  in  the  software  that 
determined  and  printed  the  number  of  graphics  displays 
viewed  during  the  scenario.  If  the  CRT  was  the  interface, 
the  number  of  graphics  displays  viewed  would  have  been  a 
performance  variable  of  Interest. 

When  the  subject  selected  the  "Finished  with  this 
analysis"  menu  option,  the  internal  clock  marked  time  and 
Instructions  to  go  on  to  the  next  scenario  were  given. 
The  subject  read  the  new  case  study  facts  from  hardcopy, 
viewed  the  updated  graphic  displays  of  his/her  choice  via 
menu  selection  and  made  decisions  that  were  In  the  form  of 
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multiple  choice  and  true/false  questions  given  in  hardcopy 
along  with  the  case  study  text.  This  process  was  repeated 
for  the  third  scenario.  When  the  subject  was  finished  and 
selected  the  ’’Finished  with  this  analysis"  option,  the 
internal  clock  marked  time  and  the  program  printed  the 
requested  times  and  total  number  of  graphic  displays 
reviewed.  The  subject  was  instructed  to  request  from  the 
experimenter  the  "regional  manager's  interview"  or  post 
test. 

This  post  test  was  In  hardcopy  and  in  the  same  format 
as  previous  decisions.  Since  the  post  test  was  based  on 
information  given  during  the  first  three  scenarios  and  not 
on  new  facts,  it  did  not  have  graphical  information 

displays  for  reference.  Therefore  the  subject  was  asked 
to  move  to  the  free  desk  next  to  the  CRT  to  complete  the 
post  test.  The  experimenter  stressed  to  the  subject  that 

accuracy  of  the  decisions  and  not  speed  was  Important. 
The  post  test  was  not  timed.  When  the  subject  was 
finished  the  experimenter  asked  him/her  to  stay  for  a  post 
experiment  interview. 

While  the  subjects  were  completing  the  post  test,  the 
experimenter  scored  the  other  parts.  After  they  were 
finished  with  the  post  test,  the  experimenter  interviewed 


them  personally,  one  on  one.  The  Interview  consisted  of 
asking  them  If  they  encountered  any  difficulty  when 
Interfacing  with  the  computer  via  the  CRT  screen  and  key 
board.  Did  they  have  any  problems  In  understanding  any  of 
the  written  or  verbal  instructions  or  scenario  text?  They 
were  also  requested  to  Interpret  a  graphic  display's 
legend  and  charts  which  were  presented  In  front  of  them  in 
hardcopy.  The  experimenter  chose  several  questions  from 
those  they  answered  and  asked  them  to  explain  what  the 
questions  meant  to  them  and  how  they  arrived  at  the 
answers.  These  questions  Included  all  they  had  answered 
incorrectly  as  well  as  those  with  which  others  had  a 
problem.  Any  other  comments  were  welcomed.  The  subjects 
were  thanked  for  their  time  and  efforts  and  asked  not  to 
discuss  the  experiment  with  anyone. 

Results 

Pretesting  was  concerned  with  timing,  easiness  of 
working  with  the  CRT  and  keyboard,  clarity  of  instructions 
and  text.  Interpretation  of  graphical  displays  and 
questions  and  difficulty  of  the  questions.  The  data  and 
comments  from  the  10  subjects  were  reviewed  which  resulted 
In  several  changes  In  the  proposed  instrument. 

Timing  of  the  experimental  session  was  targeted  at 
approximately  45  minutes.  Less  time  may  not  have  allowed 


the  subject  to  be  absorbed  Into  the  case  study  thus  giving 
It  total  concentration.  Longer  time  may  have  resulted  In 
lost  Interest  toward  the  end  of  the  experiment  by  the 
subject*  yielding  no  or  Invalid  data.  Pretesting  times 
Indicated  an  average  of  55  minutes.  It  was  decided  to 
delete  four  questions  that  not  only  took  time  but  were 
also  found  difficult  to  Interpret  by  several  of  the 
subjects. 

All  subjects,  whose  attitudes  toward  Interfacing  with 
the  computer  ranged  from  very  uncomfortable  to  very 
comfortable,  reported  the  menu-driven  program  very  easy  to 
control.  They  commented  the  Instructions  were  clear  and 
the  operation  simple.  Several  said  they  were  distracted 
by  others  entering,  conversing  and  leaving  the  room.  To 
control  for  this,  the  terminal  used  for  the  experiment  was 
planned  to  be  put  In  the  far  corner  where  it  was  quieter 
and  the  subject's  back  could  face  the  door. 

There  did  not  appear  to  be  any  difficulty  reading  and 


1 nterpret 1 ng  the 

graph  leal 

displays. 

Several 

of 

the 

subjects  did  not 

real Ize 

at  first 

that  al 1 

of 

the 

questions  were  based  on  the  information  given  In  the  text 
and  on  the  graphical  displays.  However,  most  came  to  that 
realization  about  half  way  through  the  first  scenario.  In 
efforts  to  eliminate  this  problem,  the  hardcopy 
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Instructions  on  this  point  were  rewritten  for  clarity  and 
the  experimenter  verbalized  this  Instruction  twice  In  the 
presentation  during  the  actual  experiment. 

Six  questions  were  consistently  missed  among  the 
subjects.  From  their  comments,  four  of  these  questions 
were  deleted  because  of  the  confusion  factor.  The  other 
two  questions  were  retained.  They  were  answered 
Incorrectly  not  because  of  misinterpretation  but  because 
of  level  of  difficulty.  They  were  among  the  most 
challenging  of  questions.  In  fact,  all  who  missed  them, 
when  asked  how  they  Interpreted  them  and  arrived  at  their 
answers,  realized  their  mistake  without  any  Input  from  the 
experimenter . 

The  difficulty  of  the  questions  were  analyzed  by 
scoring  the  10  sets  of  case  studies  and  figuring  the 
average  of  each  scenario  by  Itself  and  In  total.  The  four 
deleted  questions  were  not  Included.  The  averages 
calculated  ranged  from  80%  to  91%.  This  was  higher  than 
the  population  targeted  average  of  75%.  However  the 
higher  scores  were  expected  because  of  the  pretesting 
sample  used.  They  were  all  graduate  students  currently 
enrolled  In  a  graduate  decision  analysis  course  that  was 
leading  up  to  covering  a  similar  problem  type.  Those 
subjects  were  getting  Into  that  mode  of  thought  prior  to 
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being  exposed  to  the  experiment.  Too,  all  of  them  partook 
In  the  experiment  within  one  hour  after  their  two  hour 
decision  analysis  class.  Based  on  this  analysis  it  was 
determined  the  questions  were  at  the  proper  difficulty 
level  that  would  yield  data  analysis  that  would  be 
meaningful  to  the  research  question. 

During  the  pretesting  of  the  procedures  and 

instrument,  the  Tektronlcs  4027  color  terminal  became 
available  for  hardwiring  into  the  Harris  800  mainframe. 
It  was  then  the  problems  were  encountered  and  researched 
as  previously  discussed  under  Design  of  Study  Interface. 
When  It  was  decided  to  use  hardcopy  as  the  human-machine 
Interface,  several  changes  had  to  be  made  to  the 

Instrument.  The  text  Instructions  were  rewritten  to 

reflect  the  use  of  hardcopy  instead  of  the  CRT  interface. 
These  changes  were  minor.  Pretesting  was  accomplished  by 
recontacting  three  of  the  subjects  and  asking  their 

interpretation  of  the  instructions.  No  difficulty  was 
encountered.  Too,  hardcopy  graphic  displays  had  to  be 
computer  generated  with  the  color  treatment  added  where 
appropriate.  The  hardcopy  graphic  displays  did  not  have 
to  be  pretested  as  they  had  already  been  used  in  the 
original  pretesting  session  for  interpretation  analysis. 

After  all  the  changes  were  incorporated  into  the 


design,  it  was  determined  to  proceed  with  the  laboratory 
experiment.  Analysis  of  the  design  and  the  pretesting 
Information  gave  confidence  that  the  experiment  would 
yield  data  which,  when  analyzed  properly,  would  allow  the 
researcher  to  gain  Insight  on  the  research  question. 

Experimental  Procedures 

Int reduction 

The  same  procedures  were  followed  when  conducting  the 
experiment  as  were  suggested  during  the  pretesting  phase 
with  several  additions.  The  subjects  were  selected  and 
scheduled  for  the  experiment,  the  treatments  were 
completely  randomized  In  their  assignment  to  the  subjects, 
the  subjects  were  tested  for  color  deficiency,  and  the 
experiment  run. 

Sample  Selection 

The  experimenter  visited  six  of  the  engineering 
prefixed  courses  that  were  taught  during  the  summer 
sessions,  1983  at  ASU.  The  experiment  was  defined  as  a 
case  study  that  involved  making  decisions  based  on 
Information  provided  on  graphic  displays.  It  was 
emphasized  that  the  only  prerequisites  were  that  they  be 
enrolled  In  the  engineering  course  and  would  volunteer  an 
hour  and  a  half  of  their  time  and  effort.  No  special 


skills  or  knowledge  was  necessary.  It  was  open  to  all  and 


incentive  homework  points  were  offered  by  five  of  the 


instructors.  The  treatments  (monochrome  and  color)  were 
not  mentioned.  A  signup  sheet  was  passed  around  for 
volunteers  to  schedule  a  time  convenient  for  them.  In  all 
of  the  classes,  over  90%  volunteered. 

Scheduled  Environment 

The  experiment  was  administered  at  eight  different 
times,  four  times  each  in  the  morning  and  afternoon. 
Three  different  rooms  were  used  as  the  environment,  each 
used  at  least  once  In  the  morning  and  in  the  afternoon. 
One  of  the  rooms  was  a  classroom,  the  second  a  lecture 
hall  classroom  and  the  third  a  graphics  laboratory 
classroom.  All  rooms  had  flat  tabletop  desks.  The 
classroom  and  lecture  hall  desks  were  29  Inches  high.  The 
chairs  In  these  two  rooms  were  identical  and  were  the 
contoured  backed  chairs  with  seat  height  17.25  inches  from 
the  floor.  The  laboratory  desks  were  drafting  tables, 
which  when  the  top  was  in  the  horizontal  position  as  it 
was  during  experimentation,  it  was  37  Inches  high.  A 
backless  stool  was  used  whose  seat  was  26  inches  from  the 
floor.  The  lighting  was  440  LUX  (morning)  and  500  LUX 
(afternoon)  in  the  classroom,  and  a  constant  430  and  460 
LUX  respectively  in  the  windowless  lecture  hall  and 
graphics  laboratory.  The  noise  level  in  each  setting  was 


only  that  made  by  the  participants  themselves  and  the 


experimenter.  A  quiet  study  environment  was  maintained. 
Signs  posted  on  the  rooms'  doors  were  successful  In 
deterring  outside  distractions.  The  only  major 
distractions  In  the  rooms  were  the  movements  of  the 
experimenter  and  those  of  the  subjects  when  leaving  after 
they  had  finished. 

Randomized  Treatments 

Two  techniques  were  used  in  attempts  to  randomize  the 
treatments  among  the  sample,  yet  at  the  same  time  trying 
to  achieve  a  balanced  design  and  control  for  exogenous 
variables.  The  signup  system  allowed  the  experimenter  to 
estimate  how  many  to  expect  at  each  session.  To  insure  an 
ample  and  an  equal  number  of  treatments  were  available  at 
each  experimental  session,  three  or  more  was  added  to  the 
signup  number  to  make  It  a  multiple  of  five.  It  was 
divided  by  five  to  determine  how  many  of  each  treatment 
were  needed.  Using  a  random  number  table,  the  treatments 
were  ordered  and  assigned  a  sequential  number.  These 
treatments  were  placed  blank  side  up  in  sequential  order 
on  every  other  desk  In  the  experiment  room,  starting  from 
the  speakers  right  to  left,  front  to  back.  The  rooms  were 
vacant  when  the  treatments  were  distributed.  The  subjects 
arrived  and  sat  at  any  vacant  desk  that  had  a  case  study 


on  It.  The  experimenter  Insured  none  of  the  subjects 
touched  the  Instruments  before  being  told  to  do  so.  A 
post  experimental  survey  of  a  random  sample  from  each 
session  Indicated  the  subjects  were  unaware  of  the 
different  treatments  both  prior  to  and  after  completing 
the  experiment.  Based  on  these  two  procedures,  it  is 
assumed  that  every  subject  had  a  equal  chance  of  receiving 
any  one  of  the  treatments,  which  according  to  Kendall  and 
Stewart  (1971)  satisfies  the  principle  of  randomization  of 
treatments. 

This  process  Insured  that  at  each  of  the  eight 
sessions  approximately  the  same  number  of  each  treatment 
was  administered.  This  allowed  the  experimenter  to 
control  for  exogenous  variables  that  may  have  otherwise 
biased  the  Inferences  gained  from  the  data.  The  key 
exogenous  variables  of  concern  were  competition,  verbal 
Instructions  and  subject  questions,  courses  from  which  the 
subjects  came,  desk  type,  noise,  lighting,  time  of  day, 
day  of  week  and  month. 

A  competitive  atmosphere  often  exists  In  a  classroom 
In  a  testing  environment.  This  had  a  potential  of 

affecting  the  performance  of  a  worker  In  different  ways. 
No  matter  what  the  nature  of  the  competitive  affect,  it  is 
assumed  It  did  not  favor  any  particular  treatment  since 


each  type  were  represented  In  approximately  equal 
quantities. 

The  written  instructions  were  controlled  but  the 
verbal  ones  may  have  varied  each  time  they  were  given  by 
the  experimenter  even  though  the  same  written  outline  was 
used.  Even  when  the  exact  words  were  verbal ly  read. 
Inflections  of  certain  Instructions  had  a  potential  of 
affecting  the  outcome  of  the  experiment.  Too,  after  the 
verbal  directions  were  given,  the  experimenter  entertained 
all  questions  which  In  themselves  could  have  influenced 
the  outcome.  Again  since  all  treatments  were  administered 
in  approximately  equal  numbers  in  each  session,  it  is 
assumed  this  factor  did  not  bias  any  one  treatment. 

Most  sessions  were  scheduled  to  be  convenient  for  the 
volunteers,  which  usually  meant  right  before  or  after 
their  classes.  Therefore  the  majority  of  subjects/session 
came  from  the  same  class.  To  wash  out  any  bias  that  may 
have  arisen  from  this  homogeneity  of  the  sample,  the  five 
treatments  were  administered  In  equal  numbers.  The 
objectives  of  this  research  did  not  include  generalizing 
on  a  population  of  engineers  dt scriminated  by  specific 
coursework  background. 

Like  the  particular  course  experience,  the  desk  type, 
noise,  calendar  date,  day  and  time  of  day  were  not 
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considered  variables  of  interest  to  this  research. 
However,  logistics  did  not  allow  these  factors  to  be 
considered  or  assumed  constants.  Therefore,  by  insuring  a 
balance  of  treatments  under  each  condition,  it  is  assumed 
that  these  factors,  though  varied,  showed  no  favor  to  any 
one  or  more  of  the  five  treatments. 

Color  Deficiency 

As  the  subjects  selected  a  seat  for  the  experiment, 
the  experimenter  screened  each  for  potential  color 
deficiency.  If  a  subject  missed  three  or  more  of  the 
patterns  shown  in  the  plates  of  the  Dvorine 
Psuedo- I sochromat i c  Test,  they  were  asked  to  forego 
participating  in  the  experiment,  given  class  incentive 
points  and  requested  to  leave.  Nine  males  and  zero 

females,  which  is  approximately  proportional  to  the 
general  population  (DeMars,  1975)  were  asked  to  leave. 
There  was  never  more  than  two  in  a  session  who  did  not 
identify  the  required  number  plates. 

Experimental  Procedures 

After  al 1  subjects  were  seated,  had  been  tested  for 
potential  color  deficiency,  and  completed  the  Volunteer 
and  Background  and  Experience  forms,  the  formal  experiment 
began.  Instructions  were  given;  the  first  scenario 
worked,  timed  and  collected;  the  next  two  scenarios  given. 


completed,  timed  and  collected;  and  finally  the  post  test 
taken. 

Time  was  allotted  for  each  subject  to  read  the 
Instructions  and  first  scenario  of  the  case  study.  After 
all  had  done  this,  the  experimenter  verbalized  the 
instructions,  defined  the  terms  revenue,  cost,  overage, 
shortage,  profit  and  loss,  and  emphasized  the  decisions 
were  based  solely  on  the  information  provided  in  the  case 
study.  All  questions  were  then  entertained.  After  the 
last  was  answered,  the  experimenter  again  reminded  the 
subjects  that  the  decisions  were  based  only  on  the 
information  provided;  the  scenarios  built  on  each  other 
but  were  not  available  for  review  once  completed,  and 
decisions  could  be  made  in  any  order  but  once  marked  could 
not  be  changed.  She  also  stressed  that  as  soon  as  they 
finished  the  first  scenario  they  were  to  raise  their  hand 
because  timing  was  Important  to  the  study.  The  group  was 
then  directed  to  turn  over  the  case  study  packet  located 
in  the  upper  left  of  their  desk  and  begin.  The  recorder 
marked  the  start  time. 

As  the  subjects  worked,  the  experimenter  and  timer 
both  scanned  the  room  to  insure  that  each  were  using  the 
graphics  in  their  activities.  Since  this  direction  did 
not  appear  clear  during  the  pretesting  phase,  the 


experimenter  wanted  to  Insure  it  was  for  the  experiment. 
During  all  the  sessions,  it  was  not  apparent  to  either  the 
experimenter  or  the  timer  that  any  of  the  subjects 
misunderstood  this  direction. 

The  timer  continually  scanned  the  room  for  those  who 
finished.  His  objective  was  to  mark  the  clock  time  when 
the  subject  raised  their  hand  indicating  the  scenario  was 
done.  The  timer  monitored  for  time  as  well  as  any 
indications  that  a  subject  was  finfshed  but  did  not  follow 
the  directions  concerning  raising  their  hand.  Three 
subjects  fell  into  this  category,  which  the  timer  caught 
and  recorded  the  clock  time  after  each  looked  up  and 
before  they  started  gazing  and  wondering  what  to  do  next. 
With  only  about  15  subjects  in  each  session,  the  timer 
located  himself  so  he  could  see  all  the  subjects  papers 
without  moving  about  the  room.  It  is  assumed  since  the 
sessions  were  small,  the  same  timer  and  his  digital  watch 
used,  and  the  same  procedures  followed,  the  timing  was 
consistent  over  all  treatments  and  sessions.  If  it  was 
not,  the  inconsistencies  were  random  over  the  treatments. 

When  a  subject  raised  thefr  hand  indicating  the  end 
of  the  first  scenario,  the  timer  recorded  the  time  and  the 
experimenter  went  to  the  front  of  the  subject's  desk.  She 
took  the  first  scenario's  text  and  graphics  displays,  and 


reviewed  the  pertinent  instructions  for  the  next  two 
scenarios  In  a  quiet  one-on-one  conversation.  The  second 
scenario  was  handed  face  down  to  the  subject  and  the  third 
part  laid  in  the  upper  left  corner  of  their  desk.  When 
the  subject  turned  the  second  scenario  over  to  work  it, 
the  experimenter  signaled  to  the  timer  to  record  the  start 
time. 

The  subject  was  instructed  to  work  through  the  second 
scenario  and  when  finished,  lay  it  (text,  decisions  and 
graphics)  face  down  on  the  empty  desk  adjacent  to  them. 
They  were  to  continue  right  on  to  the  third  scenario  that 
was  located  at  the  top  left  corner  of  their  desk.  From 
the  observations  made  by  both  the  experimenter  and  timer, 
no  one  had  difficulty  following  these  instructions. 

As  for  the  first  scenario,  each  subject  was  directed 
in  writing  and  verbally  to  raise  their  hand  when  finished 
with  the  third  scenario.  The  timer  followed  the  same 
procedures  in  determining  the  time  to  record  and  marked  it 
as  he  did  for  the  first  scenario.  For  this  time,  every 
subject  appeared  to  follow  the  requested  directions 
concerning  raising  their  hand. 

When  the  subject  finished  the  Interactive  part  of  the 
case  study,  the  experimenter  approached  the  front  of  the 
desk.  She  cleared  the  desk  of  everything  and  handed  the 


112 


E 


l« 


subject  the  post  test.  The  subject  was  verbally  told  the 
post  test  was  based  on  the  previous  information  and 
decisions.  This  part  was  not  timed  and  accuracy  or 
quality  of  decision  was  the  important  concern.  Each  were 
thanked  for  their  generous  part icipation  and  were  asked  to 
leave  quietly  after  they  finished  the  decisions  and  turned 
over  the  paper  on  their  desk.  They  were  reminded  not  to 
discuss  the  experiment  with  anyone. 

The  experimenter  randomly  selected  three  from  each 
session  to  interview  after  the  experiment.  She  did  this 
to  discover  if  any  had  knowledge  of  the  key  treatments 
being  tested  either  before  of  after  the  experimental 
session.  Twenty-three  were  unaware  their  graphics 
displays  were  different  than  others  in  the  room.  One 
noticed  on  his  way  out  that  another  had  colored  graphics 
displays  while  his  were  black  and  whfte. 

After  each  session  the  experimenter  checked  through 
each  turned- In  case  study.  This  was  done  only  to 
determine  If  all  the  questions  in  each  part  were  answered, 
not  for  scoring.  Only  one  problem  was  encountered.  One 
subject  only  completed  the  first  two  questions  of  the  post 
test.  Since  this  did  not  occur  during  the  last  session. 


the  data  were  considered  invalid,  were  not  used  and  the 
subject  was  not  counted  against  the  total  needed  for  that 


treatment  type.  From  a  quick  scan  of  the  case  studies 
from  the  last  session,  which  brought  the  total  to  120,  the 
data  collection,  thus  the  experimental  phase  of  this 
research,  was  finished. 


IV.  ANALYSIS  AND  RESULTS 


I ntroduct Ion 

The  objective  of  this  research  was  to  gain  insight  on 
how  the  attribute  of  color,  when  used  functionally  in 
computer  generated  mixed  graphics  displays,  affects  the 
performance  of  a  decision  maker  who  has  an  engineering 
background.  A  laboratory  experiment  was  created  that 
simulated  a  decision  making  environment  in  an  attempt  to 
gather  data  that,  when  analyzed  properly,  would  lend 
insight  to  the  problem.  This  chapter  focuses  on  the 
analysts  portion  of  the  research.  Its  development  follows 
the  chronological  order  of  the  types  of  analysis  done. 

The  first  section  discusses  the  analysis  of  the 
problem  statement  which  resulted  in  defining  hypotheses 
which,  when  tested  statistically,  would  satisfy  the 
objective  of  this  research.  This  section  delineates  all 
the  formal  hypotheses  considered  in  this  project.  The 
second  section  presents  the  analysis  techniques  others 
have  employed  when  engaged  in  similar  types  of  research. 
Since  the  analytic  tools  selected  depended  not  only  on  the 
hypothesis,  but  also  on  the  properties  of  the  data  used  to 
test  the  hypothesis,  the  criteria  underlying  the  tools  are 
then  examined  In  the  third  section  in  light  of  both  the 
hypotheses  and  data.  Based  on  the  degree  of  satisfaction 
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of  the  criteria,  the  appropriate  analytic  tools  are 
identified  and  applied  to  the  data.  The  results  are  given 
and  conclusions  offered  for  all  hypotheses  under 
consideration  in  the  fourth  section.  The  chapter 
concludes  with  the  results  and  conclusions  derived  from 
the  overall  trends  in  the  data. 

Hypotheses 

The  problem  statement  was  broken  down  into  many  parts 
and  rewritten  in  a  form  that  was  conducive  to  rigorous 
objective  analysis.  Each  part,  or  hypothesis,  narrowed 
the  general  problem.  This  allowed  an  in-depth  study  on 
different  aspects  of  the  problem  as  well  as  an  overall 
analysis  of  the  problem  when  considering  all  the  results 
as  a  whole.  This  multi -hypotheses  approach  gives  strength 
to  the  conclusions  derived  from  the  results. 

The  performance  of  the  decision  maker  with  an 
engineering  background  Is  what  was  measured  during  the 
experiment.  Performance  was  defined  in  terms  of  time  to 
make  decisions  and  quality  or  accuracy  of  decisions. 
Three  different  times  were  defined:  total  time  to 

accomplish  the  interactive  part  of  the  case  study,  time  to 
complete  the  first  scenario  which  did  not  require  any 


memory,  and  time  to  work  those  graphics  supported 
scenarios  that  mandated  memory  to  make  many  of  the 
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dec  I s ions . 

Quality  of  decision  was  the  other  quantitative 
measure  of  performance.  It  was  subdivided  into  12  types, 
each  of  which  was  operationally  defined  in  the 
Performance-Accuracy  section  of  the  previous  chapter  and 
are  defined  again  in  the  Performance  Measure  Selection 
section  of  this  chapter.  They  included  total  score  of  the 
case  study,  sum  score  of  the  Interactive  parts,  no  memory 
interactive  score,  memory  interactive  score,  post  test 
score,  direct  read  from  one  graph  score,  direct  read  from 
more  than  one  graph  score,  extrapolation  from  one  graph 
score,  extrapolation  from  more  than  one  graph  score,  short 
term  memory  score,  long  term  memory  score  and  long  term 
memory  extrapolation  score. 

Hypotheses  were  formulated  that  incorporated  each  of 
the  performance  variables  with  every  combination  of 
treatment  and  characteristic  factor.  These  are  listed  in 
Table  4.1,  Research  Formal  Hypotheses.  Each  hypothesis 
is  two  sided  with  the  equality  of  performance  in  the  null 
(Ho)  and  the  inequality  in  the  alternate  (HI)  hypothesis. 
Subsequently  only  the  null  hypotheses  are  stated,  with  the 
alternatives  implied.  The  general  mathematical  form  is 

H(k)os  u(1)  *  u(J)  ViJ,  i =CRT ,  RND ,  OUT,  PAS,  B/W 

J=CRT ,  RND,  OUT,  PAS,  B/W 
k=Characteristf c  Factor 


H(k) 1 :  u( i )  *  u( j) 


for  at  1  east  one  i , j 
k  Constant 


where 

u  =  average  performance  of  those  using  a  particular 
mixed  graphics  display  treatment.  Performance  was 
measured  in  15  ways.  3  of  which  were  a  function  of 
time,  12  a  function  of  decision  quality. 

i.j  »  the  particular  graphics  treatment.  The  five 
treatments  under  study  included 

CRT  Standard  CRT  color  solid  filled  graphics 
RND  Random  selection  of  available  CRT  color 
solid  filled  graphics 
OUT  Standard  CRT  color  outlined  graphics 
PAS  Pastel  color  solid  filled  graphics 
B/W  Random  selection  of  available  monochrome 
intensity  pattern  graphics 

k  =  characteristic  factor  whose  effect  on  the 

performance  was  accounted  prior  to  testing  the 
hypothesis.  There  were  seven  characteristic 
factors,  each  subdivided  into  several  levels. 

These  factors  Included  age,  sex,  college 
experience,  education  level,  CRT  experience, 
attitude  toward  Interfacing  with  a  computer  and 
video  game  experience. 


This  generic  hypothesis  was  therefore  tested  for  15 
alternative  measures  of  performance  and  7  characteristics 
factors.  This  represented  105  different  tests.  Table  4.1 
specifically  defines  the  null  hypothesis  for  each  of  the 
f orms . 


Literature's  Analytic  Approach 


The 

two 

bas i c  ana  lytic 

tool  3 

employed  by  those 

reporting 

their 

experimental 

studies 

on  the  effects 

of 

color  in 

the 

literature  were 

a  form 

of  the  Analysis 

of 

Variance  (ANOVA)  and  the  t-test  (Christ  etal.,  1977; 


Table  4.1 


RESEARCH  FORMAL  HYPOTHESES 

H(k)o:  u(1)  *  u(j)  Vl.J  UCRT,  RND,  OUT,  PAS,  B/W 

J =CRT , RND , OUT , PAS , B/W 
^Characteristic  Factor 

H(k)l:  u(l)  i  u(j)  for  at  least  one  i,j  k  Constant 


TIME 


Total  Time  (TT) 

H(k)o:  u (TT  of  CRT)*u(TT  of  RND) =u(TT  of  OUT)=u(TT  of  PAS) 
=u(TT  of  B/W) 

No  Memory  Time  (NT) 

H(k)o:  u ( NT  of  CRT)*u(NT  of  RND)=u(NT  of  OUT)=u(NT  of  PAS) 
=u(NT  of  B/W) 

Memory  Time  (MT) 

H(k)o:  u(MT  of  CRT)=u(MT  of  RND)=u(MT  of  OUT)=u(MT  of  PAS) 
=u ( MT  of  B/W) 


QUALITY  OF  DECISION 
Total  Score  (TS) 

H(k)o:  u(TS  of  CRT)=u(TS  of  RND)=u(TS  of  OUT)=u(TS  of  PAS) 
=u(TS  of  B/W) 

I  nt eract I ve  Score  (IS) 

H(k)os  u( IS  of  CRT)=u( IS  of  RND)=u(IS  of  OUT)=u(IS  of  PAS) 
*u( IS  of  B/W) 

No  Memory  Interactive  Score  (NS) 

H(k)o :  u(NS  of  CRT)*u(NS  of  RND)=u(NS  of  OUT)=u(NS  of  PAS) 
=u (NS  of  B/W) 

Memory  Interactive  Score  (MS) 

H(k)o:  u( MS  of  CRT)*u(MS  of  RND)*u(MS  of  OUT)=u(MS  of  PAS) 
=u(MS  of  B/W) 


Post  Test  Score  (PS) 

H(k)o:  u(PS  of  CRT) =u(PS  of  RND)=u(PS  of  OUT)=u(PS  of  PAS) 
*u(PS  of  B/W) 

OIrect  Read  1  Graph  Score  (Dl) 

H(k)o:  u(Dl  of  CRT)*u(Dl  of  RND)»u(Dl  of  OUT)*u(Dl  of  PAS) 
*u(Dl  of  B/W) 
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Tabl e  4.1  (Cont . ) 


RESEARCH  FORMAL  HYPOTHESES 


Direct  Read  2  Graphs  Score  (D2) 

H(k)o:  u(D2  of  CRT)=u(D2  of  RND)=u(D2  of  0UT)=u(D2  of  PAS) 
*u(D2  of  B/W) 

Extrapolation  1  Graph  Score  (El) 

H(k)o:  u(E 1  of  CRT)=u(El  of  RND)=u(El  of  OUT)=u(El  of  PAS) 
»u(E 1  of  B/W) 

Extrapolation  2  Graphs  Score  (E2) 

H(k)o:  u(E2  of  CRT)=u(E2  of  RND)=u(E2  of  0UT)=u(E2  of  PAS) 
=u(E2  of  B/W) 


Short  Term  Memory  Score  (SM) 

H(k)o:  u(SM  of  CRT)=u(SM  of  RND)=u(SM  of  OUT)=u(SM  of  PAS) 
=u (SM  of  B/W) 

Long  Term  Memory  Score  (LM) 

H(k)os  u ( LM  of  CRT)=u(LM  of  RND)®u(LM  of  OUT)=u(LM  of  PAS) 
*u (LM  of  B/W) 


Long  Term  Memory  Extrapolation  Score  (LE) 

H(k)o:  u (LE  of  CRT)=u(LE  of  RND) =u( LE  of  OUT)=u(LE  of  PAS) 
*u(LE  of  B/W) 
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Hitt,  1961;  Hoisman  et  al . ,  1963;  Kanarick  and  Petersen, 
1971;  Munns,  1968;  Schutz,l961;  Smith,  1963;  Smith  and 
Thomas,  1964;  Smith  et  al.,  1965;  Saenz  and  Riche,  1974; 
Tull  is,  1981;  Wedel 1  and  Alden,  1973).  The  underlying 
assumptions  which,  when  satisfied,  give  these  analytic 
tools  their  validity,  were  very  seldom  addressed  in  any  of 
the  literature  findings.  The  ANOVA  or  t-test  was  applied 
to  the  data,  the  statistics  generated  and  tested  for 
significance  at  the  .05  significance  level,  with 
conclusions  and  generalizations  offered.  For  most  of  the 
studies,  the  choice  of  analytic  tool  was  based  solely  on 
the  hypothesis  under  scrutiny  without  consideration  of  the 
properties  of  the  data. 

In  the  literature  when  both  time  and  quality  of 
decision  were  of  concern,  the  most  common  procedure  was  to 
treat  these  measures  of  performance  as  separate  variables 
without  detailing  Justification.  However  in  Hitt's  (1961) 
study  of  the  relative  effectiveness  of  using  selected 
abstract  coding  in  a  map  reading  task,  he  combined  them, 
using  decision  accuracy/time  as  the  measure  of 
performance.  Hitt  reported  two  main  reasons  for  doing 
this.  First,  the  product  moment  correlation  between 
accuracy  and  speed  was  -.85.  Based  on  this,  he  concluded 
the  two  measures  were  highly  linearly  correlated.  The 
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second  reason  Hitt  gave  was  that  this  transformation  was 
needed  to  achieve  a  normal  distribution  of  the  data  and 
homogeneity  of  variance.  When  each  measure  was  plotted 
alone,  each  formed  a  highly  skewed  distribution.  This 
transformation  resulted  in  an  1 nter pretab 1 e  variable  whose 
data  satisfied  the  conditions  underlying  the  ANOVA 
technique,  which  Hitt  used. 

Hoisman  et  a  1.  (1963)  Investigated  the  effects  of 
color  coding  on  subject  performance  when  completing  six 
categories  of  map  reading  tasks.  They  also  examined  the 
relationship  between  time  and  accuracy.  However  their 
correlation  analysis  of  their  pilot  as  well  as  actual 
studies'  data  suggested  no  significant  correlation.  Based 
on  this,  they  assumed  the  two  performance  variables 
Independent,  thus  analyzed  each  separately.  The  remaining 
of  the  reports  examined  time  and/or  accuracy  separately 
without  Inferring  a  Just  If  1  cat  Ion. 

Congruent  with  the  research  and  mathematical 
literature,  the  analytical  techniques  used  in  the  study 
reported  here  Involved  the  determination  of  the  form  of 
the  performance  variable,  the  definition  and  validation  of 
the  mathematical  model  and  the  analysis  of  results. 
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Ana  1  vs  i  s 

Introduction 

Each  performance  variable,  color  treatment  and 
character  I st  f  c  comb  1 nat Ion's  data  were  ana  1 yzed .  This  was 
necessary  to  test  the  hypotheses  stated  in  Table  4.1. 
Based  on  the  conclusions  derived  from  the  hypotheses 
testing,  inferences  could  be  made  on  how  using  the 
attribute  of  color  in  graphics  displays  affects  the 
performance  of  a  decision  maker  with  an  academic 
engineering  background. 

This  section  explains  how  the  data  were  prepared  for 
input,  what  software  supporting  the  data  manipulation  was 
selected,  how  the  data  were  analyzed  and  what  results  and 
conclusions  were  derived  from  analysis  of  the  above  stated 
hypotheses. 

Data  Preparation 

As  detailed  in  the  methodology  chapter,  the  raw  data 
were  contained  In  the  case  study  packages  used  as  the 
instrument  during  experimentation.  When  the  experiment 
was  completed,  all  of  the  case  studies  were  scored  in  one 
sitting  by  the  researcher.  This  helped  control  for 
consistency  in  scoring  across  all  studies.  As  noted 
earlier,  the  questions  were  deterministic  in  nature  and 
the  correct  response  was  objective  and  one  of  the 


available  multiple  choice  options. 


After  all  120  case 


studies  were  scored,  a  random  sample  of  25  were  selected 
for  rescore  to  confirm  consistency.  No  Inconsistencies 
were  cited. 

The  individual  scores  in  each  case  study  were  coded 
into  a  software  file  to  be  used  for  the  data  manipulation. 
A  hardcopy  was  printed  and  compared  for  errors  against  the 
original  case  studies'  scores,  times  and  volunteer 
characteristics.  After  transcription  corrections  were 
made,  the  researcher  was  confident  the  data  used  in  the 
analyses  were  accurate  and  valid. 

Stat i st i ca 1  Sof tware 

Because  of  the  volume  of  data  and  the  breadth  of  the 
analysis,  a  mathematical  software  package  was  sought  to 
aid  in  the  manipulation.  Two  packages  were  found  that 
were  accessible  to  the  experimenter  and  supported  the 
needed  techniques.  These  were  the  Statistical  Analysis 
System  (SAS)  and  the  Biomedical  Computer  Programs  P-Series 
(BMDP).  Not  only  did  they  have  the  necessary  analytical 
techniques,  but  they  also  had  the  option  of  interaction 
with  each  other.  This  allowed  the  output  from  one 
package's  technique  to  be  used  as  input  to  the  other's 
procedures  which  at  times  was  needed.  The  main  programs 
were  written  in  SAS  which  had  the  option  of  calling  BMDP 
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subprograms.  Both  packages  were  accessed  via  interactive 
keyboard/CRT  Interface  on  the  IBM  WYLBUR  mainframe  located 
i  n  the  ASU  Comput i ng  Serv 1 ces . 

Analysis  of  Properties  of  Data 


I ntroduct ion.  Based  on  the  literature  review  and  the 
hypotheses  under  consideration  the  ANOVA  appeared  to  be 
the  analytic  technique  that  would  generate  sufficient 
statistics  which  would  allow  inferences  to  be  made 
concerning  the  effect  color  has  on  the  performance  of  a 
decision  maker.  However  the  mathematical  literature  warns 
on  using  this  model  without  first  ascertaining  if  the  data 
support  it  (Box,  Hunter  and  Hunter,  1978,  Neter  and 
Wasserman,  1973).  The  statistics  the  model  provides 
maintain  their  sufficiency  only  when  the  data  in  the  model 
meet  certain  conditions.  These  conditions,  often  called 
assumptions,  are  Independence  of  the  error  term,  normality 
of  the  residuals,  homoscedast icity  and  additivity  of  the 
factors. 

An  analytic  algorithm  was  developed  for  the  analysis 
portion  of  this  research  based  on  the  recommendations  of 
the  mathematical  literature.  This  approach  considered  not 
only  the  characteristics  of  the  hypotheses  under 
investigation  but  also  the  properties  of  the  data  used  for 
the  analysis.  The  algorithm  is  shown  in  Figure  4.1, 


Analysis  Strategy.  Initially  a  measure  of  performance  was 
determined  (l.e.  Time  vs  Score)  based  on  correlation 


|  analysis  of  the  time  and  score  data.  Using  this  measure 

as  the  dependent  variable,  a  specific  tentative  ANOVA 
model  was  formulated.  The  four  underlying  criteria  of 

! 

i  independence,  normality,  homoscedast icity  and  additivity 

were  then  examined  separately  to  validate  the  use  of  the 
model.  Based  on  the  results  of  the  analysis  of  the 

•  assumptions,  the  appropriate  model  and  statistics  were 

chosen  for  analysis  of  the  affects  of  color  on 

performance.  Results  and  conclusions  were  generated  based 

I  on  the  ana  lysis  of  the  stat i st i c  on  how  us i ng  the 

attribute  of  color  in  graphics  displays  affects  a  decision 
maker  with  an  academic  engineering  background. 

*  The  data  analysis  portion  of  this  chapter  follows  the 

outline  given  in  Figure  4.1.  Under  each  heading  the 

criterion  Is  defined  and  examined  for  the  effect  on  the 

!  aptness  and  alternatives  to  the  model  if  the  data  depart 

from  it.  Next  the  data  for  each  research  hypothesis  are 
analyzed  with  all  results  shown  in  an  accompanying  table. 

1  Conclusions  are  offered  for  each  criterion. 

Performance  Measure  Selection.  Initial  1 y  the  form  of 


the  performance  measurement  (dependent  variable)  had  to  be 
determined.  If  evidence  existed  to  reject  the  null 


.26 


Figure  4.1  ANALYSIS  STRATEGY 
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hypothesis  that  time  and  score  were  uncorrelated,  the  two 
were  combined  as  score/time  for  the  remaining  analysis. 
If  there  was  insufficient  evidence  to  reject  the 

hypothesis  that  the  correlation  between  time  and  score  was 
zero,  then  each  was  considered  as  a  separate  measure  of 
performance  throughout  the  remaining  analysis. 

Correlation  analysis  was  done  between  the  time  and 
score  performance  variables  to  determine  the  form 

appropriate  for  the  remaining  analysis.  Since  both  the 
time  and  score  performance  were  measured  at  the  interval 
level,  the  Pearson  Product  Moment  Correlation  technique 
was  used  to  generate  the  correlation  coefficient  statistic 
r  (Conover,  1971).  The  hypothesis  Ho:^>=0  was  tested. 
The  results  are  shown  in  Table  4.2j  Correlation  Analysis: 
Score  vs  Time. 

In  Table  4.2,  the  times  recorded  and  available  for 
analysis  head  the  columns.  Total  time  was  the  overall 
time  In  seconds  used  to  complete  those  parts  of  the  case 
study  where  graphics  displays  and  scenario  text  were 
accessible.  No  Memory  Time  was  the  time  lapsed  to  finish 
the  first  scenario,  in  which  recall  was  not  needed  for  the 
decision  making.  Memory  Time  was  the  time  passed  when 
completing  Scenarios  2  and  3.  These  two  scenarios 
contained  a  mix  of  question  types,  many  requiring  memory 
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Table  4.2 

CORRELATIONS  ANALYSIS:  SCORE  vs  TIME 
Ho:  p  =0 
p  Values 

Reject  Hypothesis  If  pi.05 
Sample  Size  N=120 


'■'^TIME 

Total  Time 

No  Memory  Time 

Memory  Time 

QUALITY^\ 

Total 

.81 

.37 

.49 

Sum 

.87 

.07 

.  16 

Scene  1 

.46 

.52 

Scene  2&3 

.40 

.41 

Post  Test 

.78 

.08 

.26 

D  Read  1 

.67 

0  Read  > 1 

.81 

Extrap  1 

.28 

Extrap  >1 

.37 

Short  Mem 

.77 

.92 

Long  Mem 

.81 

.39 

.25 

Long  Mem 
Extrap 

.60 

.10 

.50 

*  Rejected  hypothesis  at  pi  .05 


of  previous  Information  or  decisions.  The  post  test  was 
not  timed. 

The  quality  of  decision  variables  (scores)  are  listed 


in  the 

first 

co 1 umn 

of 

rows .  Each 

1  s 

operat i ona 1 1 y 

defined 

below 

when 

the 

results  of 

the 

analysis  are 

discussed. 

The  entries  in  the  table  are  the  computed  levels, 
often  referred  to  as  the  p  values.  The  null  hypothesis 
tested  was  the  correlation  coefficient  is  equal  to  zero 
or  that  there  is  no  correlation  between  the  time  variable 
at  the  top  of  the  column  and  the  score  variable  at  the 
beginning  of  the  row.  If  the  computed  oL  level  or  p  value 
was  greater  than  an  appropriate  significance  level,  the 
conclusion  was  that  insufficient  evidence  existed  to 
reject  the  hypothesis  of  zero  correlatior  .  Entries  are 
presented  only  for  those  time/ score  combinations  which  had 
relevance. 

Total  Score  of  the  entire  case  study  did  not  appear 
correlated  with  the  overall  time  it  took  to  complete  it. 
There  is  an  .81  probability  of  being  wrong  if  the  decision 
of  correlation  is  made.  Nor  was  there  evidence  to  suggest 
total  score  was  correlated  to  either  no  memory  time 
(p=.37)  or  no  memory  time  (p=.49). 

Sum  Score,  the  sum  of  the  scores  obtained  when  the 


decisions  required  long  term  memory  and/or  extrapolation 
of  data.  There  was  no  evidence  to  suggest  the  time  spent 
using  the  graphics  displays  whether  as  a  whole  (p=.78)  or 
broken  down  (p=.08  and  p*. 26)  consistently  affected  the 
quality  of  decisions  made  during  the  post  interview. 

Direct  Read  1  graph  score  was  the  sum  of  scores  of 
all  decisions  requiring  reading  from  only  one  graph. 
Memory  or  computation  of  data  was  not  required.  The 
questions  included  were  1-2,  i-3,  1-6,  1-9  and  2-4.3. 

These  were  the  simplest  decisions.  The  Direct  Read  score 
did  not  appear  correlated  to  the  total  time  (p=.67). 

Direct  Read  >1  graph  score  was  the  sum  of  scores  of 
all  decisions  requiring  reading  from  more  than  one  graph 
which  did  not  require  memory  or  manipulation  of  data. 
Questions  1-1,  1-4,  2-1,  3-1,  3-2  and  3-7  defined  this 

variable.  Based  on  a  p=.81,  it  was  assumed  there  was  not 
a  significant  linear  correlation  between  this  score  and 
Total  Time. 

Extrapolation  1  graph  score  was  the  composite  score 
of  decisions  which  required  the  decision  makers  to  extract 
information  from  the  text  and/or  one  graphics  display,  and 
manipulate  it  to  arrive  at  the  solution.  Memory  was  not 
required.  The  questions  were  spread  across  the  first 
three  scenarios  to  Include  1-5,  2-3,  3-4  and  3-5.  There 


was  not  evidence  to  suggest  a  correlation  between  this 

measure  of  quality  of  decision  and  Total  Time  (p=.28). 

Extrapolation  >1  graph  score  was  defined  the  same  as 
the  above  score  except  the  decision  was  based  on 
information  given  in  two  or  more  graphics  displays. 

Questions  1-5,  1-7  and  3-6  composed  the  score.  The  data 

did  not  give  support  (p=.37)  to  correlation  between  this 
quality  score  and  Total  Time. 

Short  Term  Memory  score  was  the  sum  of  points  from 

decisions  in  the  interactive  scenarios  (Scenario  2,  3  and 
3)  which  required  memory  of  previously  presented 

information  and/or  decisions.  These  questions  included 
2-2,  2-4.1,  2-4.2,  2-4.4,  2-4.5  and  3-3.  Computation  was 

not  necessary  in  these  decisions.  Based  on  the  p  values 
of  .77  and  .92,  it  was  assumed  a  linear  relationship  did 
not  exist  between  Short  Term  Memory  score  and  either  Total 
or  Memory  Time  respectively. 

Long  Term  Memory  score  was  a  measure  of  qual ity  of 
decision  made  during  the  post  test.  It  is  divided  into 
two  categories.  The  first.  Long  Term  Memory,  pooled  the 
points  of  the  post  test  questions  which  required  only 
memory  and  no  further  calculation  or  manipulation  of  data. 
These  questions  were  4-1,  4-2,  4-7,  4-8,  4-9  and  4-10. 

The  second  category.  Long  Term  Memory  Extrapolation, 


Included  those  post  test  decisions  which  needed  further 


data  or  Information  manipulation  for  solutions.  These 
questions  included  4-3,  4-4,  4-5,  4-6  and  4-11.  Based  on 
the  derived  p  values,  neither  of  these  scores  appeared 
correlated  with  any  of  the  time  categories. 

The  overall  results  indicate  there  was  insufficient 
evidence  to  conclude  that  time  and  quality  of  decision 
were  linearly  correlated.  Therefore  It  is  assumed  for 
this  research  the  covariance  is  zero  between  the  time  and 
any  qual ity  score  measure  of  performance.  Thus  each  were 
analyzed  separately. 

Def i ne  Mode 1 .  A  majority  of  the  treatments  and 

characteristic  factors  were  classified  qualitatively. 
Performance,  a  function  of  time  or  decision  score,  was  a 
quantitative  variable  measured  at  least  on  the  interval 
scale.  The  statistical  literature  suggests  the  2-factor 
fixed  effects  linear  ANOVA  approach.  It  allows  comparison 
of  multiple  means  based  on  several  factors  simultaneously 
without  loss  of  power  (Montgomery,  1976)  and  yields 
sufficient  statistics  if  all  of  the  underlying  assumptions 


are  satisfied  (Box  et  al . , 


1978).  The  ANOVA  chosen  was 


y(ikl)=u  +  /7'(i)  +>d(k)  +  *70  ( i  k )  +  £  ( i  k  1 )  i  =T  reatment  Leve  1 

k=Character i st i c 
Level 

1 =Observat 1 on 
Number 

where  y ( I k 1 )  was  the  (Ikl)th  performance  (time  or  decision 
quality  score)  observation  and  u  was  the  overall  mean 
performance  effect.  The  7*0)  represented  the  true  effect 
of  the  ith  level  of  color  treatment  (B/W,  CRT,  OUT,  RND  or 
PAS).  The  jS  (k)  was  the  true  effect  of  the  kth  level  of 
the  character i st i c  factor  (age,  sex,  college  experience, 
education  level,  CRT  attitude  and  experience  and  video 
game  experience).  (70)  (Ik)  represented  the  effect  of  the 
interaction  between  the  ith  color  treatment  and  the  kth 
characteristic  factor  and  ^(ikl)  was  the  random  error 
component  of  the  (ikl)th  observation.  Table  4.3f 
Variables  for  Analysis  lists  the  variables,  levels  and 
sample  sizes  for  each  term  defined  in  the  models  used  for 
analyses.  The  treatment  factor  is  balanced  across  all 
levels  at  24  sample  points.  This  was  controlled  by  the 
experimenter.  The  sample  size  in  the  levels  of  al I  the 

characteristic  factors  was  a  function  of  the  random 
sample.  The  performance  variables  are  listed  with  sample 
size  of  120  points  for  each  of  them. 

A  fixed  effects  model  was  directed  by  the  problem 


Table  4.3 


VARIABLES  for  ANALYSIS 
Treatment  Factor  (7* I)  Sample  Size 

Color 


CRT  Color  (CRT) 

24 

Black  &  White  (B/W) 

24 

Outline  (OUT) 

24 

Random  ( RND ) 

24 

Pastel  (PAS) 

24 

Characteristic  Factor  (/3k) 

Sample  ! 

Age  (years) 

17-22 

41 

23-28 

52 

29+ 

27 

Sex 

Femal  e 

33 

Male 

87 

College  Experience  (years) 

<3 

37 

>3  and  <5 

30 

15 

53 

Education  Level 

<2  years 

25 

>2  yrs  and  not  Graduate  status 

66 

Graduate 

29 

CRT  Experience 

Not  a  User  (0-3  Never  Use) 

33 

Moderate  User  (0-6  hrs/wk) 

44 

Frequent  User  06  hrs/wk) 

43 

Attitude  toward  Computer  Use 

Uncomfortable 

31 

Comfortabl e 

48 

Very  Comfortable 

41 

Video  Game  Experience 

Do  Not  or  Seldom  Play  (0-3  hrs/wk)  39 

Moderate  Player  (>3  hrs/wk)  81 


Table  4.3  (Cont) 


VARIABLES  for  ANALYSIS 


Performance  Variables  y{lkl) 

Sample  ' 

Quality  Score  (Percentage  Points) 

Total  Score 

120 

Sum  Score 

120 

Scene  1  Score 

120 

Scene  2/3  Score 

120 

Post  Test  Score 

120 

Direct  Read  1  Graph 

120 

Direct  Read  >1  Graph 

120 

Extrapolat Ion  1  Graph 

120 

Extrapolation  >1  Graph 

120 

Short  Memory 

120 

Long  Memory 

120 

Long  Memory  Extrapolat f on 

120 

Time  (Seconds) 

Total  Time 

120 

No  Memory  Time 

120 

Memory  Time 

120 

statement.  The  objective  was  to  determine  if  the  presence 
of  color  in  a  mixed  graphics  display  affected  the 
performance  of  a  decision  maker.  Each  color  combination 
was  chosen  for  a  particular  reason  as  described  in  the 
earlier  chapter.  Since  each  color  combination  available 
on  a  CRT  display  interface  did  not  have  an  equal 

probability  of  being  selected,  the  model  had  to  be  fixed 
as  opposed  to  randomized.  Inferences  made  from  the 

results  are  constrained  to  the  color  treatments  employed. 
Model  Val idation 

Introduction.  Though  the  ANOVA  model  appeared  to 
yield  information  that  would  allow  insight  into  the 
research  problem,  it  had  to  be  examined  for  its  aptness 
(Box  et  al.,  1978;  Neter  and  Wasserman,  1973).  The 
statistics  this  model  provides  maintain  their  sufficiency 
only  when  the  data  used  in  the  model  meet  certain 
conditions.  These  conditions,  often  called  assumptions, 
are  Independence  of  the  error  term,  normality  of 

residuals,  homoscedasticity  and  additivity  of  factors. 
Each  of  these  conditions  was  examined  in  light  of  how 

departures  affect  the  aptness  of  the  model.  Using  the 

analytic  tools  suggested  in  the  mathematical  literature, 
the  data  were  analyzed  to  determine  the  degree  of 

satisfaction  of  the  criteria.  The  assumptions  are 
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addressed  in  the  order  shown  in  Figure  4.1. 

Independence.  Independence  of  the  error  terms  can 
not  be  relaxed.  The  error  term  Is  considered  independent 
if  the  subjects  in  each  of  the  levels  of  factor  groups  are 
randomly  and  Independently  selected  so  the  outcomes  of  any 
subject  had  no  effect  or  influence  on  any  other's  outcome 
within  or  among  any  of  the  groups.  Proper  randomization 
of  the  experiment  introduces  independence  in  the 
assignment  of  the  treatments  to  subjects.  The  resulting 
experimental  errors  may  then  be  regarded  as  independent. 
Usually  proper  randomization  is  achieved  by  randomly 
assigning  the  order  in  which  the  observations  are  taken 
such  that  every  treatment  had  the  same  chance  of  being 
applied  to  each  eligible  subject  (Kendall  and  Stuart, 
1966;  Montgomery,  1976). 

As  detailed  in  the  Experimental  Procedure  section  of 
the  methodology  chapter,  each  color  treatment  was  randomly 
assigned  so  that  every  level  of  treatment  had  the  same 
probability  of  being  applied  to  each  volunteer  subject. 
Therefore  it  is  assumed  the  random  errors  are  independent 
and  this  criterion  satisfied. 

Normality  of  Residuals.  The  ANOVA  is  relatively 
robust  against  the  criterion  of  residual  normality  for  a 
fixed  effects  model.  The  point  estimates  of  the  factor 
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level  means  are  unbiased  as  long  as  the  distribution  of 
the  residuals  form  a  be II -shaped  curve.  The  ANOVA  uses 
the  F  test  for  the  equality  of  factor  level  means,  which 
is  relatively  insensitive  in  terms  of  power  and 
significance  to  the  lack  of  normality  (Berenson,  Levine 
and  Goldstein,  1983;  Box  et  al.,  1978;  Neter  and 

Wasserman,  1973).  If  there  is  reason  to  believe  that  the 
data  do  not  form  at  least  a  be 11 -shaped  curve,  the 

literature  recommends  applying  an  appropriate 
transformation  of  the  data  to  make  the  distribution  normal 
or  be 11 -shaped. 

Several  criteria  were  used  to  determine  if  there  was 
evidence  to  support  the  rejection  of  the  hypothesis  of 
residual  normality.  The  residuals  for  each  cell  of  each 

ANOVA  model  matrix  under  consideration  were  calculated. 
The  Shapiro-Wilk  test  was  applied  to  test  for  normality. 
If  the  derived  eC  value  In  each  of  the  cells  where  the 
hypothesis  was  tested  was  greater  than  .05,  the  hypothesis 
was  not  rejected,  the  residuals  were  assumed  normal  and 
the  entry  In  Table  4.4,  Normality  of  Residuals,  recorded 
as  N.  For  those  cells  In  a  model  for  which  the  derived*, 
was  less  than  .05,  the  kurtosls  and  skewness  statistics 
were  analyzed  for  bell-shaped  determination.  If  the 
skewness  was  less  than  1.0  and  kurtosls  less  than  2.5,  the 


distribution  was  assumed  normal/bell  denoted  by  N/B  in 
Table  4.4  (Lewis  and  Ford,  1983).  If  the  skewness  was 
less  than  2.0  and  there  were  two  or  fewer  of  the  10  or  15 
cells  that  had  kurtosls  greater  than  2.0  but  less  than 
7.0,  the  distributions  of  the  cell  residuals  were  assumed 
be 1 1 -shaped.  These  were  coded  B  in  the  Table  4.4.  Those 
which  did  not  satisfy  any  of  the  above  were  not  assumed 
either  normal  or  bell -shaped  but  skewed.  These  were  coded 
S  in  Table  4.4. 

Table  4.4  presents  the  results  of  testing  the 

residuals  for  normal ity  or  be 11 -shaped  character i st i cs. 
The  paired  independent  factors  head  the  column  and  the 
performance  measure  considered  starts  the  row.  Using  the 
criteria  stated  above,  the  statistics  from  the 
Shaplro-Wilk  normality  test  were  analyzed.  As  shown  in 
the  table,  there  was  insufficient  evidence  to  reject  the 
assumption  the  residuals  were  distributed  normally  or  at 
least  be  11 -shaped  for  all  performance  measures  except  one. 
The  exception  was  the  model  whose  performance  measure 
defined  quality  of  decision  derived  from  direct  reading  of 
only  one  graphics  display.  Analyzing  these  cell 

distributions  indicated  all  were  highly  skewed  to  the 
left.  This  was  anticipated  since  it  was  the  simplest  of 
decisions  and  all  the  scores  clustered  at  the  maximum 
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cene  1 

cene  2&3 

ost  Test 

Read  I 

Read  > 1 

xtrap  l 

xtrap  > 1 

hort  Mem 

ong  Mem 

ong  Mem 
Extrap 


Table  4.4 

NORMALITY  of  RESIDUALS 
Ho:  Residual  Are  Normal  or  Bell -Shaped 


Color 

& 

Age 

Color 

& 

Sex 

Color 

& 

Col  Ex 

Color  , 
& 

Edu  Lv 

Color 

& 

CRT  Ex 

Color 

& 

Attlt 

Color 

& 

VGame 

B 

N 

N/B 

B 

N 

N/B 

N 

N 

N 

B 

N 

B 

N 

N/B 

B 

N/B 

B 

N/B 

B 

B 

B 

N 

N/B 

N 

N 

N 

B 

B 

B 

N/B 

B 

N/B 

N 

B 

B 

S 

S 

S 

S 

S 

S 

S 

B 

N/B 

N/B 

N/B 

N/B 

B 

B 

B 

N/B 

N 

B 

N/B 

N/B 

N/B 

B 

N/B 

B 

B 

N/B 

B 

B 

B 

B 

B 

B 

N 

B 

N/B 

B 

N/B 

N/B 

1 

B 

B 

B 

N/B 

N/B 

N/B 

N 

B 

N/B 

N 

B 

B 

B 

B 

B 

B 

N/B 

N/B 

B 

B 

N 

N/B 

N 

B 

N/B 

N 

B 

N 

B 

N 

N 

N/B 

N  Norma  1 

N/B  Norma  1 /Be 1 1 -Shaped 
B  Bell -Shaped 
S  Skewed 


score.  Thus  the  model  using  the  direct  read  from  one 
graph  as  the  performance  measure  was  not  considered  for 
further  analysis.  Instead  It  Is  used  to  validate  the 
assumption  the  subjects  interpreted  the  graphics  displays 
correctly.  Therefore,  for  the  remaining  analyses,  it  is 
assumed  the  normality  of  residual  assumption  is  satisfied 
to  the  degree  necessary  for  valid  use  of  the  2  factor 
fixed  effect  linear  model  for  all  hypotheses  under 
scrutiny. 

Homogeneity  of  Variances.  Unless  the  sample  sizes 
are  equal,  departures  from  the  criteria  of 
homoscedast 1 c 1 ty  could  seriously  affect  the  level  and 
power  of  the  F  test  (Berenson  et  al.,  1983;  Box  et  al., 
1978;  Brown  and  Forsythe,  1974;  Montgomery,  1976;  Neter 
and  Wasserman,  1974).  Unequal  variances  invalidates  the 
use  of  the  ANOVA  model  when  sample  sizes  are  unequal 
across  treatments. 

There  are  several  methods  available  to  determine  if 
the  variances  are  equal.  One  such  model  is  the  Modified 
Levene's  L  test  which  is  relatively  robust  to  the 
normality  assumption  (Berenson,  et  al.,  1983)  and  does  not 
require  equal  sample  sizes. 

The  literature  suggests  two  ways  to  handle  the 


problem  of  unequal  variances:  transformation  of  data  or 
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recalculation  of  the  F  statistic  of  the  ANOVA.  A 
transformation  is  a  viable  option  if  the  residual  data  are 
not  distributed  normally  as  It  usually  restores  normality 
as  well  as  stabilizes  the  variances.  However  It  has  been 
found  that  if  a  transformat  Ion  is  applied  to  normal  data 
in  attempts  to  equate  the  variances,  normality  is  almost 
always  destroyed  (Berenson  et  al.,  1983;  Neter  and 
Wasserman,  1974).  If  this  occurs,  the  approximation  by 
Welch  or  Brown-Forsythe  can  be  employed  directly  on  the 
original  data  in  lieu  of  the  usual  ANOVA  to  compare  the 
means  (Berenson  et  al.,  1983;  Brown  and  Forsythe,  1974). 

Welch  and  James  bor;h  extended  the  Welch  asymptotic 
confidence  Interval  approach  for  the  two-means  problem  to 
more  than  two  groups.  Brown  and  Forsythe  (1974)  studied 
the  behavior  of  the  F  statistic  (ANOVA),  W  statistic 
(Welch),  the  J  statistic  (James),  and  F*  statistic 
(Brown-Forsythe  modified  F).  From  their  results  they 
concluded  that  when  the  variances  were  equal,  both  the  W 
and  F*  are  close  in  power  to  the  F.  The  choice  between  W 
and  F*  depends  on  whether  the  extreme  means  are  thought  to 
have  extreme  variances  (W)  or  not  (F*).  When  the 
variances  were  not  equal,  the  F  statistic  showed  marked 
deviation  from  Its  nominal  size  (3-17%).  Therefore  they 


feel 


fjWil 


Considering  the  small  loss  in  power  in 
using  F*  or  W  rather  than  F  even  when  the 
population  variances  are  equal ,  protection 
against  F's  distortion  in  size  recommend  the  use 
of  F#  or  W.  The  asymptotic  approximations  of 
the  critical  values  of  F*  and  W  are  valid  when 
each  has  at  least  10  observations  and  are  not 
unreasonable  for  W  down  to  5  observations  (Brown 
and  Forsythe,  1974,  p.  131). 

Levene's  test  of  equal  variance  was  used  to  test  the 
hypothesis  that  all  variances  in  a  specific  model  were 
equal.  This  test  did  not  require  the  data  to  be  normal  or 
the  sample  sizes  equal.  The  results  shown  for  each  model 
are  presented  in  Table  4.5,  Homogeneity  of  Variances. 

The  matrix  is  the  same  as  used  in  Table  4.4.  The 
Independent  factor  combinations  head  the  columns  and  the 
performance  measures  start  the  rows.  The  table  entries 
are  the  computed  or  p  values.  The  null  hypothesis 
tested  was  the  variances  in  a  specific  model's  cells  were 
a  I  1  equa  I .  If  the  computed  or  p  value  was  greater  than 
the  significance  level,  set  at  .05,  the  data  did  not 
contain  sufficient  evidence  to  reject  the  hypothesis  of 
equal  variances. 

Those  tests  where  there  was  evidence  to  support 
rejecting  the  equal  variance  hypothesis  are  denoted  by  an 


asterisk  (*).  Since  the  data  were  assumed  at  least  bell 


shaped,  falling  this  underlying  assumption  meant  using  the 
Welch  W  or  Brown-Forsythe  F*  statistic  for  further 


HOMOGENEITY  of  VARIANCE 


Ho:  Cell  Variances  Equal 
p  Values 

Reject  Hypothesis  If  p<..05 


Factors 

Performance 

Color 

& 

Age 

Color 

& 

Sex 

Color 

& 

Col  Ex 

Color 

& 

Edu  Lv 

Color 

& 

CRT  Ex 

Color 

& 

Attlt 

QUAL I TY 

Total 

.62 

.13 

.06 

.08 

.80 

.56 

.15 

Sum 

.  17 

.07 

.  1 1 

.07 

.79 

.51 

.15 

Scene  1 

.00* 

.00* 

.  15 

.00* 

.01* 

.  15 

.03* 

Scene  2&3 

.33 

.78 

.31 

.75 

.94 

.35 

.21 

Post  Test 

.33 

.03* 

.06 

.00* 

.03* 

.  13 

.21 

D  Read  > 1 

.25 

.23 

.23 

.40 

.15 

.39 

.73 

Extrap  1 

.38 

.53 

.52 

.68 

.84 

.  15 

.62 

Extrap  > 1 

.95 

.39 

.85 

.  10 

.39 

.45 

.51 

Short  Mem 

.20 

.19 

.68 

.74 

.  19 

.06 

.03* 

Long  Mem 

.04* 

.06 

.01* 

.01* 

.  1  1 

.03* 

.00* 

Long  Mem 

.52 

.21 

.88 

.26 

.36 

.  13 

.93 

Extrap 

TIME 

Total 

.02* 

.56 

.  16 

.56 

.73 

.45 

.15 

No  Memory 

.06 

.50 

.00* 

.15 

.08 

.02* 

.20 

Memory 

.04* 

.61 

.48 

.34 

.  19 

.23 

.69 

*  Rejected  hypothesis  at  p^  .05 


I 


analyses. 


For  those  models  whose  data  did  not  give 


sufficient  evidence  to  reject  the  equality  of  variance, 
further  study  was  done  by  applying  the  two-factor  fixed 
effects  ANOVA  technique  using  the  F  statistic. 

Table  4.6,  ANOVA  Statistic  for  Anal ys i s,  presents  the 
particular  statistic  used  based  on  the  equality  and  size 
of  variance.  If  the  variances  were  assumed  equal  as  shown 
In  Table  4.5,  the  ANOVA  F  statistic  (F)  was  used  for 
further  analysis.  If  there  was  evidence  to  suggest 
hetrogenei ty,  most  cell  sample  sizes  were  at  least  10  and 
the  extreme  means  did  not  have  proportional  variance,  the 
Brown  -  Forsythe  F*  statistic  (F*)  was  employed  for  further 
analysis.  Finally,  if  the  variances  of  a  model  were  not 
considered  homogeneous,  a  majority  of  the  cells  sizes 
numbered  less  than  10  or  the  extreme  means  were  estimated 
to  have  extreme  variances,  the  Welch  W  statistic  (W)  was 
used.  As  a  result.  Table  4.6  displays  the  ANOVA  statistic 
used  to  resolve  the  research  questions.  Of  the  98 
hypotheses,  80  were  examined  using  the  F  statistic,  15 
using  the  Welch  W  statistic  and  3  using  the  Brown-Forsythe 
F*  statistic. 

Additivity  of  Main  Effects.  Factor  effects  are 
considered  additive  If  there  are  no  significant  factor 
Interactions.  Each  factor  can  be  described  separately  by 
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Table  4.6 

ANOVA  STATISTIC  for  ANALYSIS 


^V^actors 

Performance 

Color 

& 

Age 

Color 

& 

Sex 

Color 

& 

Col  Ex 

Color 

& 

Edu  Lv 

Color 

& 

CRT  Ex 

Color 

& 

Attlt 

Color 

& 

VGame 

OUAL I TY 

Total 

F 

F 

F 

F 

F 

F 

F 

Sum 

F 

F 

F 

F 

F 

F 

F 

Scene  1 

W 

F* 

F 

W 

W 

F 

W 

Scene  2&3 

F 

F 

F 

F 

F 

F 

F 

Post  Test 

F 

W 

F 

W 

W 

F 

F 

0  Read  >1 

F 

F 

F 

F 

F 

F 

F 

Extrap  1 

F 

F 

F 

F 

F 

F 

F 

Extrap  > 1 

F 

F 

F 

F 

F 

F 

F 

Short  Mem 

F 

F 

F 

F 

F 

F 

F* 

Long  Mem 

W 

F 

W 

W 

F 

W 

F* 

Long  Mem 

F 

F 

F 

F 

F 

F 

F 

Extrap 

TIME 

Total 

W 

F 

F 

F 

F 

F 

F 

No  Memory 

F 

F 

W 

F 

F 

W 

F 

Memory 

W 

F 

F 

F 

F 

F 

F 

F  ANOVA  F  Statistic 
F*  Brown-Forsythe  Statistic 
W  Welch  Statistic 


analyzing  the  factor  level  means  of  the  factor  main 
effects.  However  if  significant  interaction  exists 
between  the  main  factors,  the  main  effects  statistics  have 
little  practical  meaning.  When  this  is  the  circumstance, 
the  literature  suggests  holding  one  factor  constant  while 
applying  the  ANOVA  on  the  others  in  order  to  draw 
conclusions  on  the  non-constant  main  effects  (Montgomery, 
1976;  Neter  and  Wasserman,  1974). 

Additivity  was  checked  in  all  98  cases  by  testing  the 
hypothesis  that  the  interaction  effect  was  zero.  If  there 
was  insufficient  evidence  to  reject  this  hypothesis,  the 
main  effects  were  considered  additive  and  analyzed  by  the 
suggested  statistic  shown  In  Table  4.6. 

However  If  the  data  indicated  the  Interaction  effect 
was  not  zero,  the  character ist 1c  factor  was  held  constant 
at  each  of  its  levels  and  the  entire  analytical  process 
repeated  starting  at  the  testing  of  residual  normality. 
The  statistics  derived  from  the  repeat  analysis  on  the  new 
subset  of  data  were  used  for  analysis  of  the  main  effects. 
Inferences  based  on  this  data  were  constrained  to  the 
particular  level  of  the  characteristic  factor. 

Table  4.7,  Additivity  Analysis,  presents  the  p  values 
which  were  calculated  when  testing  the  hypothesis  the 
interaction  factor  (*7/9)  Ik  was  zero  or  the  main  effects 
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Table  4.7 

ADDITIVITY  of  MAIN  EFFECTS 
Hos  (Ttf)  ij  =  0 
p  Values 

Reject  Hypothesis  If  p<..05 


^\Factors 

Performance 

Color 

& 

Age 

Color 

& 

Sex 

Color 

& 

Col  Ex 

Color 

& 

Edu  Lv 

Color 

& 

CRT  Ex 

Color 

& 

Attit 

Color 

& 

VGame 

QUALITY 

Total 

.24 

.01* 

.43 

.  17 

.91 

.48 

.27 

Sum 

.37 

.02* 

.52 

.07 

.96 

.32 

.32 

Scene  1 

.65# 

.09# 

.40 

.03** 

.82# 

.07 

.27# 

Scene  2&3 

.24 

.03* 

.72 

.23 

.91 

.63 

.46 

Post  Test 

.43 

.03** 

.34 

.34# 

.61# 

.57 

.53 

D  Read  > 1 

.65 

.04* 

.65 

.21 

.92 

.87 

.82 

Extrap  1 

.86 

.87 

.32 

.04* 

.22 

.35 

.77 

Extrap  > 1 

.44 

.27 

.89 

.49 

.71 

.51 

.06 

Short  Mem 

.06 

.24 

.46 

.46 

.93 

.36 

.29# 

Long  Mem 

.59# 

.61 

.82# 

.39# 

.43 

.06# 

.35# 

Long  Mem 

.56 

.  1  1 

.  19 

.31 

.73 

.62 

.23 

Extrap 

TIME 

Total 

.42# 

.35 

.03* 

.89 

.92 

.47 

.75 

No  Memory 

.98 

.39 

.03** 

.84 

.59 

.41# 

.77 

Memory 

.16# 

.31 

.20 

.84 

.92 

.43 

.70 

*  Rejected  Hypothesis  based  on  F  Statistic  at  p^.05 

**  Rejected  Hypothesis  based  on  Inflated  F  Statistic  at 
p£.  05 

#  p  Value  using  (1.25XANOVA  F) 
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were  additive.  The  entries  in  Table  4.7  were  calculated 
in  two  ways,  depending  on  the  method  used  for  analysis: 
ANOVA  F  or  Welch  and  Brown-Forsythe  (see  Table  4.6:  ANOVA 
Statistic  for  Analysis).  If  the  ANOVA  F  technique  was 
applied,  the  F  statistic  reported  for  the  interaction  term 
was  used  to  determine  the  tail  probability  or  p  value. 
They  are  indicated  by  a  plain  or  *  entry  in  Table  4.7. 
The  *  indicates  the  data  gave  evidence  to  reject 
additivity  of  the  main  factors  at  the  .05  significance 
level.  For  these  cases,  the  characteri st ic  factor  was 
held  constant  at  the  different  levels  and  the  entire 
analysis  procedure  repeated,  beginning  at  the  step  of 
testing  for  normality  of  residuals.  These  cases  Included 
Color  x  Sex  vs  Total  Score,  Color  x  Sex  vs  Sum  Score, 
Color  x  Sex  vs  Scenario  2  &  3  Score,  Color  x  Sex  vs  Direct 
Read  >1  Graph  Score,  Color  x  College  Experience  vs  Total 
Time,  and  Color  x  Education  Level  vs  Extrapolation  1  Graph 
Score. 

For  those  cases  which  earlier  analyses  indicated 
Welch  W  or  Brown-Forsythe  F*  should  be  applied,  the 
entries  in  Table  4.7  were  calculated  by  inflating  the 
ANOVA  F.  The  available  Welch  method  only  tested  main 


effects.  Brown  and  Forsythe  (1974)  reported  their 


analyses  showed  the  ANOVA  F  statistic  fluctuated  as  much 
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as  17% 

when 

the 

sma  1  1 

samples  had 

1  arge 

var i ances . 

To 

account 

for 

a  possible 

fluctuation 

that 

may  affect 

the 

testing 

of 

the 

additivity  hypothesis. 

the  ANOVA 

F 

statistic  was  Inflated  by  25%  before  determining  the  p 
value.  — ■  If  the  data  contained  evidence  that  rejected 
additivity,  the  inflated  F  would  be  more  sensitive  in 
detecting  It.  These  entries  are  noted  in  the  table  by  # 
or  **  symbols.  The  #  indicated  there  was  no  evidence  to 
reject  additivity,  therefore  the  Welch  W  or  Brown-Forsythe 
F*  statistic  was  used  to  analyze  the  main  effect  of  color. 
For  those  with  **,  Interaction  may  exist.  For  these 
models,  the  characteristic  factor  was  held  constant  at  the 
different  levels,  and  the  entire  analysis  procedure 
repeated  starting  at  testing  the  residuals  for  normality. 
Three  cases  fell  Into  this  category:  Color  x  Sex  vs  Post 
Test  Score,  Color  x  College  Experience  vs  No  Memory  Time, 
and  Color  x  Education  Level  vs  Scenario  1  Quality  Score. 
Analysis  of  Color  Effects 

Preliminary  Color  Effects  Analysis.  After  the 
assumption  of  additivity  of  main  factors  was  satisfied, 
the  statistics  derived  from  the  ANOVA  model  became 
sufficient  estimators  of  the  parameters  of  primary 
interest.  They  were  analyzed  to  determine  If  the  average 
performance  of  the  decision  maker  with  an  academic 
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engineering  background  was  affected  differently  by  one  or 
more  of  the  color  treatments.  Only  the  statistics 
estimating  the  color  treatment  parameter  ('T'D  were  of 
concern  to  this  research.  If  the  ANOVA  F,  F*  or  W 
Indicated  no  significant  difference  among  the  treatment 
means  at  the  .05  significance  level,  the  result  was  to 
fall  to  reject  the  null  hypothesis  under  consideration. 
For  that  particular  performance  variable  for  which  the 
variability  due  to  the  characteristic  factor  Gdk)  was 
accounted,  the  results  supported  the  conclusion  that  there 
was  no  difference  In  performance  attributable  to  the  color 
of  graphics  display  used. 

For  the  three  cases  where  the  variances  were  not 
equal  and  additivity  could  not  be  assumed  (Sex  vs  Post 
Test,  Col  Exp  vs  No  Memory,  Edu  Lev  vs  Scene  1),  the 
method  of  analysis  was  reevaluated  during  the  repeat  of 
the  analysis  procedure.  These  hypotheses  contain  more 
than  one  p  value/entry  In  Table  4.8,  Analysis  of  Main 
Effects  of  Color.  These  p  values  correspond  to  those 
calculated  for  each  level  of  the  characteristic  factor 
under  consideration  and  are  reported  In  the  same  order  as 
the  levels  are  listed  in  Table  4.3,  Variables  for 
Analysis.  That  Is  for  Sex,  the  pairs  contain  the  p  values 


for  female  and  male  respectively.  The  College  Experience 
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Table  4.8 

ANALYSIS  of  MAIN  EFFECT  of  COLOR 
Ho:  T\  =  0 


p  Values 

Reject  Hypothesis  If  p<.05 


^s^Factors 

Performance 

Color 

& 

Age 

Color 

& 

Sex 

Color 

& 

Col  Ex 

Color 

& 

Edu  Lv 

Color 

& 

CRT  Ex 

Color 

& 

Attlt 

Color 

& 

VGame 

OUALITY 

Total 

.  19 

.01* 

.12 

.22 

.26 

.15 

.  13 

.03* 

Sum 

.22 

.02* 

.07 

.  12 

.38 

.15 

.  17 

.05* 

Scene  1 

.24 

.  13 

.  13 

.  1  1 
.06 
.61 

.43 

.21 

.01* 

Scene  2&3 

.32 

.04* 

.  17 

.36 

.55 

.32 

.36 

.  1  1 

Post  Test 

.  19 

.  15 

.04* 

.38 

.10 

.05* 

D  Read  > 1 

.53 

.38 

.92 

.26 

.25 

.22 

Extrap  1 

.99 

.72 

.74 

.24 

.05* 

.26 

.45 

.47 

.41 

Extrap  >1 

.64 

.28 

.23 

.  18 

.  1  1 

.21 

.05* 

Short  Mem 

.20 

.27 

.54 

.48 

.73 

.88 

.48 

Long  Mem 

.28 

.08 

.67 

.24 

.04* 

.22 

.39 

Long  Mem 
Extrap 

.57 

.23 

.41 

.57 

.35 

.27 

.  1  1 

1U3L 

Total 

.25 

.84 

.79 

.03* 

.50 

.71 

.55 

.32 

.70 

No  Memory 

.86 

.94 

.75 

.06 

.81 

.87 

.86 

.58 

.94 

Memory 

.39 

.24 

.  15 

.39 

.40 

.  12 

.44 

*  Rejected  hypothesis  at  p£  .05 
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triple  reports  the  p  value  for  Under  3  Years,  3  and  less 
than  5  Years  and  5  Years  or  more.  Less  than  2  Years,  2  or 
more  Years  and  Undergraduate  Status ,  and  Graduate  1 s  the 
order  for  the  p  values  for  Education  Level.  These  p 
values  were  based  on  the  statistics  derived  during  the 
analytic  reevaluation  of  the  revised  data  set. 

When  only  the  post  test  performance  of  males  was 
investigated,  the  data  supported  using  the  ANOVA  F. 
However,  the  variances  were  not  equal  among  the  female 
groups  of  color  treatments.  The  sample  size  in  each  color 
treatment  mandated  the  W  statistic.  When  the  Color  x 

College  Experience  vs  No  Memory  Time  was  analyzed  one 
experience  level  at  a  time,  analysis  indicated  Welch  W  be 
used  for  two  of  its  levels:  less  than  3  years  and  more 
than  5  years  college  experience.  The  ANOVA  F  was 

sufficient  for  the  middle  level  of  at  least  3  years  but 
less  than  5  years  college  experience.  After  examining  the 

data  for  the  three  levels  of  the  Education  Level  x  Color 

vs  Scenario  1  Quality  Score,  unequal  variances  and  sample 
size  suggested  Welch  W  for  the  less  than  2  years  level, 
Brown-Forsythe  F*  for  the  at  least  2  years  and 

Undergraduate.  The  data  of  the  graduates  satisfied  the 
ANOVA  F  criteria,  thus  it  was  used. 

In-depth  Analysis.  If  the  ANOVA  statistic  indicated 


a  significant  difference  (p^.05),  further  in-depth 
analysis  was  done  in  efforts  to  discriminate  among  the 
means.  This  was  accomplished  by  applying  one  of  three 
methods:  Duncan  Multiple  Range  Test,  Tukey-Kramer  Test  or 
95%  Confidence  Interval  Graphical  Inspection.  If  the 
original  model  under  consideration  assumed  equality  of 
variance  and  additivity,  the  Duncan  Multiple  Range  Test 
was  applied  to  the  data.  This  test  is  reported  to  be  one 
of  the  most  powerful  (Box  et  al . ,  1978;  Montgomery,  1976). 
If  the  variances  were  assumed  homogeneous  but  additivity 
was  rejected  and  further  analysis  was  done  using  the  data 
from  a  particular  level  of  a  characteristic  factor,  the 
Tukey-Kramer  Test  was  applied.  This  test  is  a 
modification  of  the  Tukey  Test,  and  unlike  both  the  Duncan 
and  the  Tukey  tests,  ft  is  robust  against  unequal  cell 
size.  Since  the  character i st i c  factors  were  not 
controlled  for  balance,  when  additivity  was  rejected,  the 
cell  sizes  were  guaranteed  to  be  unequal.  If  the  model 
did  not  satisfy  homoscedast 1 c i ty,  the  data  were  analyzed 
by  building  95%  confidence  intervals  about  all  the  means. 
These  confidence  intervals  were  plotted  and  a  subjective 
analysis  was  graphically  conducted.  The  results  from 
Duncan  and  Tukey-Kramer  are  also  given  as  estimates  of  the 
groupings  to  support  the  graphical  analysis.  Each  of  the 
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hypotheses  that  was  rejected  at  the  .05  significance  level 
was  analyzed  using  one  of  these  three  methods  and  are 
discussed  separately. 

Prior  to  presenting  the  final  results,  one  model  was 
selected  to  follow  through  the  entire  analysis  process. 
The  model  defined,  validated  and  analyzed  using  the 
developed  analyses  strategy  shown  in  Figure  4.1  and 

detailed  above.  The  relevant  portions  of  the  analysis 
strategy  diagram  are  filled  In  with  the  appropriate 
statistics  and  p  values  derived  from  the  models  data. 
This  is  shown  in  Figure  4.2,  Analysis  of  Color  x  Age  vs 
Scene  1&2  Quality  Score.  The  analysis  is  discussed  next 
1 n  deta 1 1 . 

The  example  model  examined  is  Color  x  Age  vs  Scene 
2&3  quality  score.  Independence  was  assumed  based  on  the 
procedures  employed  in  randomizing  the  color  treatments 
among  the  three  age  groups  under  investigation. 

The  normal ity  of  residuals  was  tested  by  examining 
the  p  values  derived  from  applying  the  Wilk-Shapiro  test 
on  the  residual  data  In  each  cell  of  the  2-factor  fixed 
effects  model.  These  are  shown  below  in  Figure  4.3. 


L5‘ 


Figure  4.2  ANALYSIS  of  COLOR  &  AGE  vs  SCENE  1&2  QUALITY 


L  58 


6 

i 


s 

p 


Figure  4.3  p  Values  Derived  from  Wilk-Shapiro 
As  seen  in  the  table,  all  the  p  values  were  greater  than 
the  .05  significance  level.  There  was  insufficient 
evidence  to  reject  the  hypotheses  that  the  residuals  in 
each  cell  of  the  ANOVA  model  were  normally  distributed. 
Therefore  a  transformation  was  not  needed. 

Hetroscedasticity  was  tested  next.  The  p  value 
resulting  from  applying  the  Modified  Levene's  test  was 
compared  to  the  .05  significance  level.  Levene's  L=1.14 
with  14,105  degrees  of  freedom  corresponded  to  p=.33.  The 
data  contained  insufficient  evidence  to  reject  the 
hypothesis  that  the  variances  were  homogeneous  across  all 
cells  of  the  defined  model.  Equal  variances  implied  the 
derived  ANOVA  F  statistics  were  sufficient  for  testing  the 
next  criterion  of  additivity  of  factors. 

Additivity  of  main  factors  was  tested  by  examining 
the  F  and  its  associated  p  value  for  the  interaction  term 
of  age  and  color.  The  2- factor  ANOVA  method  was  applied 
via  the  SAS  program  which  yielded  F=1.32  with  8,  105 


degrees  of  freedom.  The  associated  p  value  was  .24.  The 
data  contained  Insufficient  evidence  to  reject  the 
hypothesis  the  main  factors  of  color  and  age  were 
additive.  Based  on  this  result  the  ANOVA  F  was  validated 
as  a  sufficient  statistic  for  analyses  of  the  hypothesis 
under  scrutiny. 

After  the  variability  in  performance  attributable  to 
age  was  accounted,  did  color  affect  the  Scene  1&2  decision 
quality?  This  question  was  answered  by  analyzing  the 
ANOVA  F  for  the  main  factor  of  color  derived  from  applying 
the  ANOVA  on  the  2  -  factor  fixed  effect  model.  The 
resulting  F  was  1.20  with  4,  105  degrees  of  freedom  which 
corresponded  to  a  p  value  of  .32.  Since  this  p  value  was 
greater  than  .05,  the  significance  level,  the  conclusion 
was  made  the  color  factor  was  not  significant.  If  it  had 
been,  the  data  would  have  been  further  analyzed  using 
Duncan's  Multiple  Range  as  the  variances  were  assumed 
equal  as  well  as  main  factors  of  color  and  age  additive. 

This  same  procedure  was  repeated  for  all  of  the 
formalized  hypothesis  listed  in  Table  4.1.  The  results  of 
these  analyses  are  delineated  next. 

Resu 1 ts 

Introduction 

The  overall  objective  of  this  research  was  to  examine 
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how  the  attribute  of  color,  when  used  functionally  in 
computer  generated  mixed  graphics  displays,  affects  the 
decision  maker  who  has  an  academic  engineering  background. 
One  hundred  eleven  models  were  analyzed  to  gain  insight 
into  the  research  question.  The  results  of  testing  the 
null  hypothesis  that  performance  as  measured  by  quality  or 
time  of  decision  was  not  affected  by  the  color  of  the 
mixed  graphics  display  are  shown  in  Table  4.8,  Analysis 
of  Main  Effects  of  Color.  Each  entry  in  this  table  is  the 
computed  or  p  value  derived  from  the  appropriate  ANOVA 
statistic  as  discussed  previously. 

Color  Factor  Insignificant 

Ninety-eight  of  the  111  tested  hypotheses  of  equal 
performance  among  color  treatments  could  not  be  rejected 
at  the  .05  significance  level.  For  these  research 
hypotheses,  which  comprised  a  majority  of  al 1  the  cases, 
there  was  insufficient  evidence  to  conclude  any  difference 
in  the  decision  maker's  performance  that  could  be 
attributed  to  the  particular  color  (B/W,  CRT,  RND,  OUT  or 
PAS)  of  the  mixed  graphics  Interface. 

Color  Factor  Significant 

These  models  which  preliminary  analysis  of  the  data 
Indicated  sufficient  evidence  that  treatment 
differential ly  affected  performance  correspond  to  those 
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whose  entries  have  an  asterisk  (*)  In  Table  4.8.  These 
are  listed  separately  In  Table  4.9,  Hypotheses  Rejected 
at  =  .05.  There  are  13  such  cases:  females  (4),  those 
who  have  between  3  and  less  than  5  years  college 
experience  (1),  education  level  (2),  CRT  experience  (1) 
and  video  game  experience  (5).  Twelve  of  these  had 
performance  measured  by  quality  of  decision  and  one  by 
decision  time.  These  13  are  analyzed  In-depth  next  using 
Duncan,  Tukey-Kramer  or  the  95%  Confidence  Interval 
Graphical  approach.  Tables  4.10-4.14,  constructed  by 
characteristic  factor,  delineate  the  results. 

Sex.  The  data  did  not  indicate  color  affected  the 
performance,  whether  it  was  measured  as  quality  of 
decision  or  time  when  the  subjects  were  male.  However, 
the  data  suggested  females  were  affected  by  color  with 
respect  to  quality  of  decision.  Table  4.10,  Analysis  of 
Female  x  Treatment,  outlines  the  results.  Since 
homogeneity  of  variance  was  satisfied  but  the  sample 
size/cell  was  not  equal,  Tukey-Kramer  was  used  to  detect 
the  differences.  Across  the  top  of  the  columns  are  listed 
the  color  treatments  In  rank  order,  the  best  performance 
to  the  worst  performance.  Down  the  first  column  are  the 
performance  variables  which  earlier  analyses  suggested 
were  affected  by  the  color  treatment.  The  entries  In  the 


^Factors 

Performance 

Color 

& 

Sex 

(Female) 

Color 

& 

Col  Ex 
(3<5  Yr) 

Color 

& 

Edu  Lv 

Color 

& 

CRT  Ex 

Color 

& 

VGame 

OUAL I TY 

Total 

.01 

.03 

Sum 

.02 

in 

o 

• 

Scene  1 

.01 

Scene  2&3 

.04 

Post  Test 

.04 

.04 

.05 

Extrap  1 

12  &  UnG 
.05 

Extrap  > 1 

.05 

Long  Mem 

.04 

TIME 

Total 

.03 
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Table  4.10 

ANALYSIS  of  FEMALE  vs  COLOR  TREATMENT 

Performance  Not  Equal 
Across 

A1 1  Treatments 
At  p<.05 

Variances  Equal 
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table  are  the  mean  scores  for  each  color  treatment  vs 
performance  variable.  The  rank  order  for  the  treatments 
and  Tukey  Kramer  groupings  were  consistent  for  the  three 
cases  measuring  Total  Quality  Score,  Sum  (Interactive) 
Quality  Score  and  Scenario  2  S3  Quality  Score.  These 
three  performance  measures  were  highly  correlated  so  the 
results  were  expected  to  be  consistent.  The  Tukey-Kramer 
analysis  («(.  =  . 05)  grouped  (CRT  B/W  PAS  RND)  as  coded  by  A 
in  Table  4.10  and  (B/W  PAS  RND  OUT)  as  coded  by  B.  The 
CRT  color  treatment  ranked  the  best  with  B/W  the  second 
best  though  analysis  did  not  Indicate  a  significant 

difference  between  these  two.  The  CRT  treatment  yielded  a 

16.5  per  cent  Increase  In  Total  Score  over  the  B/W 
treatment,  a  11.6  per  cent  Increase  in  Sum  Score  and  a 

14.5  per  cent  Increase  In  Scene  2&3  Score  over  the  B/W 
treatment.  Overall  the  CRT  treatment  yielded  a  14.2  per 
cent  Increase  In  decision  quality  performance  when 
compared  to  B/W  when  the  decision  makers  were  females. 

The  consistency  was  not  upheld  for  the  Post  Test. 
B/W  and  PAS  switched  ranks,  but  the  others  remained  the 
same.  The  CRT  treatment  shows  a  28.9  per  cent  increase  in 

Post  Test  score  when  compared  to  the  B/W  treatment. 

However,  Tukey-Kramer  did  not  detect  any  significant 

differences  among  any  of  the  treatments,  which  may  account 
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for  the  ranking  inconsistency. 

When  measuring  performance  by  overall  quality  of 
decisions  which  the  Total,  Sum  and  Scene  2&3  quality 
scores  represent,  females  consistently  performed  better 
using  a  CRT  colored  mixed  graphics  display  rather  then  the 
B/W  monochrome  Intensity  pattern  one.  Though  the 
difference  was  not  statistically  significant  at«C  =  .05,  it 
represented  an  average  of  16.5  per  cent  in  overall  (Total) 
score,  11.6  per  cent  in  interactive  (Sum)  score,  14.5  per 
cent  in  memory  Interactive  (Scene  2&3)  score  and  a  28.9 
per  cent  in  long  term  memory  (Post  Test)  score. 

College  Experience.  Those  who  had  between  3  and  less 
than  5  years  college  experience  showed  a  preference  for 
specific  color  treatments  when  measuring  the  overall 
performance  time  (Table  4.11).  Tukey-Kramer  (equal 
variances,  unequal  cell  sizes)  suggested  no  difference  in 
the  average  total  time  between  RND  and  B/W  coded  A  In 
Table  4.11  as  wel 1  as  between  B/W,  PAS,  CRT  and  OUT  coded 
B  in  Table  4.11.  RND  was  the  best  with  B/W  being  second. 
Though  not  statistically  significant  at  .05,  this  group 
worked  quicker  when  interfacing  with  a  chromatic  display. 
Unlike  the  females,  the  RND  schematic  ranked  first 
yielding  a  2.7  per  cent  decrease  in  time  over  the  B/W. 
The  CRT  treatment  represented  a  11.5  Increase  in  time  when 
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compared  to  the  B/W. 

Education  Level.  The  ANOVA  F  statistic  suggested 
those  who  had  more  than  two  years  of  college  and  were  of 
undergraduate  status  were  affected  by  the  color  treatment 
when  they  made  decisions  Involving  extrapolat Ion  from  one 
graph.  However,  Tukey-Kramer  grouped  all  treatments  into 
one  class  (coded  A  In  Table  4.12),  thus  detected  no 
differences  among  the  means.  This  conflict  may  be  caused 
by  the  fact  the  ANOVA  F  statistic  yielded  a  p=.05  and 
Tukey-Kramer  was  not  as  sensitive,  possibly  because  of 
unequal  cell  sizes  to  discriminate  any  differences  at  .05 
significance  level.  None  the  less,  the  CRT  cases  yielded 
a  27.9  per  cent  lower  score  than  the  B/W  cases  while  the 
PAS  showed  a  5.7  per  cent  Increase  over  the  B/W. 
Chromatic  ranked  best. 

A  95%  confidence  Interval  using  the  t  distribution 


was  constructed 

around 

the 

mean  post 

test 

scores  when 

considering  all 

1  evel  s 

of 

education. 

Since 

there  was 

1 nsuff iclent 

ev i dence 

to 

support  the 

data 

were  skewed 

rather  than  normal /bell  shaped  (Table  4.4)  and  estimates 
of  the  unequal  variances  were  available,  the 
t-df str 1  but  Ion  was  used  when  building  the  confidence 
Intervals  shown  In  the  second  table  of  Table  4.12.  These 
confidence  Intervals  are  plotted  In  the  accompanying 
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Table  4.12 

ANALYSIS  of  EDUCATION  LEVEL  vs  COLOR  TREATMENT 

f  Performance  Not  Equal 

Across 

All  Treatments 
At  p< .  05 


Variances  Equal 


.^Color  Factor 

Performance"^- 

PAS 

Best 

B/W 

OUT 

CRT 

RND 

Worst 

QUAL I TY 

Extrap  1  (%) 

49.6 

46.9 

41 . 1 

33.8 

29.0 

A 

A 

A 

A 

A 

Variances  Not  Equal 


graph.  The  graph  indicates  that  two  of  the  intervals  do 
not  intersect:  CRT  (63.5,  71.5)  and  RND  (52.8  and  62.6). 
These  two  may  belong  in  different  groups.  If  Tukey-Kramer 
was  used,  its  grouping  confirms  this  hypothesis.  It 
grouped  CRT  B/W  and  OUT  (coded  A  in  Table  4.12)  as  not 
significantly  (o4=.05)  different  from  each  other  and 
neither  were  B/W  OUT  PAS  and  RND,  coded  B  in  Table  4.12. 
From  the  ranking  of  means,  CRT  color  treatment  was  the 
best,  with  B/W  second  when  measuring  performance  by  a  long 
term  memory  (Post  Test)  score.  Examination  of  the 
confidence  intervals  Indicate  though  the  CRT  and  B/W 
treatments  were  not  significantly  different,  the  CRT 
represented  approx  1 mate  1 y  a  4  per  cent  increase  in 
performance  over  B/W. 

CRT  Experience.  Duncan's  Multiple  Range  test  was 
applied  to  the  data  categorized  by  the  Color  x  CRT 
Experience  vs  Long  Term  Memory  Quality  Score  (Table  4.13). 
It  detected  differences  among  the  means  and  divided  the  5 
equal  size  groups  Into  2  categories.  The  first  included 
(CRT  B/W  RND)  and  the  second  (B/W  RND  OUT  PAS).  CRT 
ranked  first,  with  B/W  in  second  place,  with  CRT 
outscorlng  B/W  by  9.8  per  cent.  These  two,  like  all 
previous  cases  were  not  statistically  significant  from 
each  other. 


Table  4.13 

ANALYSIS  of  CRT  EXPERIENCE  vs  COLOR  TREATMENT 

Performance  Not  Equal 
Across 

All  Treatments 
At  pl.05 


Variances  Equal 


erf  or  mane? 


UAL  I TY 


CRT 

B/W 

RND 

OUT 

PAS 

Best 

Worst 

69.4 

63.2 

61 . 1 

59.0 

58.3 

A 

A 

A 

B 

B 

B 

B 
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Video  Game  Experience.  After  accounting  for  the 
effects  due  to  video  game  experience,  the  color  of  the 
graphics  display  became  significant  when  performance  was 
measured  by  Total  Score,  Sum  Score,  Post  Test  Score, 
Extrapolation  Using  >1  Graph  and  Scenario  1  Score.  The 
first  four  were  further  analyzed  using  Duncan's  Multiple 
Range  since  there  was  insufficient  evidence  to  reject 
homoskedasticity  or  additivity.  The  95%  confidence 
interval  graphical  method  was  applied  for  the  Color  x 
Video  Game  vs  Scenario  1  score  based  on  Levene's  test 
supporting  unequal  variances.  The  results  are  delineated 
in  Table  4.14,  Analysis  of  Video  Game  x  Treatment. 

Controlling  ot =.05,  Duncan's  analysis  indicated  the 
mean  total  score  was  the  same  for  the  CRT,  B/W,  PAS  and 
OUT  color  treatments  as  well  as  for  PAS,  OUT  and  RND .  CRT 
ranked  best  with  B/W  in  second  place  with  no  significant 
difference  between  them.  The  CRT  treatment  only  yielded  a 
.7  per  cent  increase  in  total  performance  over  B/W. 

The  ranking  of  the  sum  score  is  not  consistent  with 
the  total  despite  the  high  correlation  between  the  two 
performance  measures.  The  order  is  PAS,  B/W,  CRT,  OUT  and 
RND  with  the  first  four  not  statistically  different  from 
each  other.  The  last  three  were  not  either  in  accordance 
with  Duncan.  PAS  yielded  the  best  average  score  while  B/W 


Tabl e  4.14 

ANALYSIS  of  VIDEO  GAME  EXPERIENCE  vs  COLOR  TREATMENT 

Performance  Not  Equal 
Across 

All  Treatments 
At  Pi. 05 

Variances  Equal 


UAL  I TY 


CRT 

Best 

B/W 

RND 

OUT 

68.9 

68.4 

67.8 

65.3 

A 

A 

A 

A 

B 

B 

61.6 

B 


UAL  I TY 


urn  (%) 
uncan  Range 


PAS 

Best 

B/W 

CRT 

OUT 

RND 

Worst 

72.6 

71.6 

69.8 

67.8 

63.8 

A 

A 

A 

A 

B 

B 

B 

UAL  I  TY 


ost  Test(%) 
uncan  Range 


CRT 

Best 

B/W 

OUT 

PAS 

RND 

Worst 

67.5 

62.6 

61.3 

59.6 

57.7 

A 

A 

A 

B 

B 

B 

B 

UALITY 


B/W 

Best 

PAS 

OUT 

CRT 

RND 

Worst 

72.5 

A 

70.3 

A 

61.4 

A 

56.9 

A 

53.3 

A 
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Table  4.14  (Cont. ) 

ANALYSIS  of  VIDEO  GAME  EXPERIENCE  vs  COLOR  TREATMENT 

Performance  Not  Equal 
Across 

A1 1  Treatments 
At  p<.05 


i  Variances  Not  Equal 


B/W 

PAS 

CRT 

OUT 

RND 

Best 

Worst 

QUALITY 

Scene  1  (%) 

87.9 

86.3 

81.7 

79.5 

75.7 

95%  Conf  Int 

(80,95) 

(78,94) 

(74,88) 

(69,90) 

(66,85) 

TUKEY-KRAMER 

A 

A 

A 

A 

A 

Score 

(%) 

100 
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rated  second.  PAS  represented  only  a  1.3  per  cent 
increase  in  interactive  (Sum)  score  over  B/W,  with  CRT 
showing  only  a  2.5  per  cent  decrease  in  performance  with 
respect  to  B/W. 

The  average  post  test  scores  were  also  separated  into 
two  intersecting  groups.  Duncan  Included  CRT,  B/W  and  OUT 
in  the  better  group  and  B/W,  OUT,  PAS  and  RND  in  the 
second  one.  As  in  the  Total  Score  performance,  CRT  ranked 
highest  and  was  not  statistically  different  from  B/W  which 
came  in  second  best  for  the  Post  Test  Score.  CRT  yielded 
a  7.8  per  cent  increase  in  the  long  term  memory  (Post 
Test)  score  over  B/W. 

After  accounting  for  the  effects  of  video  game 
experience,  color  was  considered  significant  based  on  the 
ANOVA  F  at  p=.05  when  performance  was  measured  by  the 
Extrapolat ion  Using  >1  Graph.  However  Duncan  did  not 
discriminate  any  significant  differences  among  any  of  the 
treatment  means.  However  B/W  ranked  first,  representing 
an  Increase  in  the  Extrapolation  >1  Graph  score  of  3.0  per 
cent  over  PAS,  15.3  per  cent  over  OUT  and  21.5  per  cent 
over  CRT. 

When  analyzing  the  graphed  957.  confidence  intervals 
computed  for  the  Scenario  1  Score  means,  there  was  not  an 
obvious  split  among  groups.  The  (80.9,  85.2)  score 


interval  was  contained  in  all  of  them. 


Also,  if  the 


Duncan  and  Tukey-Kramer  tests  were  applied  as  estimates  of 
|  the  differences,  neither  reported  significant  differences 

among  any  of  the  Scenario  1  Score  means,  which  supports 
the  graphical  analysis.  Similar  the  the  Extrapolation  >1 
i  Graph  score  Just  discussed,  achromatic  outscored  chromatic 

with  the  ranking  order  of  B/W,  PAS,  CRT,  OUT  and  RND.  B/W 
outscored  PAS  by  about  1  per  cent  and  CRT  by  approximately 
by  4  per  cent. 

After  accounting  for  the  variability  in  performance 
caused  by  video  game  experience,  when  in-depth  analysis 

1 

I  indicated  difference  among  treatments,  chromatic  displays 

consistently  ranked  highest  with  B/W  in  second  place.  The 
difference  was  always  statistically  Insignificant  at  «t= 
I  .05,  with  the  average  increase  in  score  only  being  4.9  per 

cent.  CRT  showed  an  increase  of  .7  per  cent  in  overall 
(Total)  decision  quality  over  B/W;  PAS  a  1.3  per  cent 
Increase  In  Interactive  (Sum)  decision  quality  over  B/W, 
and  CRT  a  7.8  per  cent  Increase  in  long  term  memory  (Post 
Test)  score  over  B/W.  Though  the  differences  were  never 
significant  and  the  increase  in  decision  quality  small,  as 
measured  by  scores  comprised  of  all  types  of  decisions, 
performance  was  best  when  chromatic  displays  were  the 
interfaces.  The  CRT  ranked  first  twice  and  PAS  ranking 
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first  once.  The  B/W  monochrome  intensity  pattern 
consistently  was  second  in  performance. 

Conclusions 

The  research  question  addressed  in  this  chapter  was 
how  the  attribute  of  color,  when  used  functionally  in 
computer  generated  mixed  graphics  displays  affects  the 
performance  of  a  decision  maker  who  has  an  academic 
engineering  background.  Five  color  treatments  were  used 
in  a  laboratory  experiment.  One  treatment  was  achromatic 
consisting  of  black  and  white  computer  generated  intensity 
patterns.  The  other  four  were  chromatic,  representing 
possible  variations  of  CRT  color  outputs.  These 
treatments  were  functionally  Incorporated  into  hardcopy 
mixed  graphics  displays  used  by  120  subjects  enrolled  in 
the  College  of  Engineering,  ASU  to  complete  a  one  hour 
managerial  planning  deterministic  decision  making  case 
study.  Potential  variations  in  data  due  to  seven 
different  characteristics  of  the  subjects  were  controlled 
prior  to  analysis  of  the  research  question.  Performance 
was  measured  as  a  function  of  time  (3  levels)  and  as  a 
function  of  decision  quality  (12  levels).  As  a  result, 
111  hypotheses  and  ANOVA  models  were  formulated  and 
validated  for  rigorous  analysis  of  the  effects  of  color  on 
performance.  These  111  cases  are  shown  in  Table  4.8, 


Analysis  of  Main  Effect  of  Color. 


Of 

the 

111  tests,  98 

resulted  in 

insuff icient 

evidence 

to 

cone  1 ude  that  a 

s 1 gn 1 f 1  cant 

difference  in 

performance  occurred  among  any  of  the  five  treatments. 
Even  after  the  affects  of  characteristic  factors  such  as 
age,  sex,  college  level  and  experience,  attitude  toward 
the  computer,  computer  experience  and  video  game 
experience  were  taken  Into  account  In  the  variation  of  the 
mean  performance,  the  color  attribute  did  not  test 
statistically  significant  at  the  .05  significance  level. 

For  those  13  cases  listed  in  Table  4.9,  Hypotheses 
Rejected  at  oi  -  .05,  preliminary  analysis  indicated  the 

data  contained  evidence  that  supported  that  the  attribute 
of  color  In  the  mixed  graphics  display  did  affect  the 
performance  of  decision  makers.  For  11  of  these,  shown  in 
Table  4.15,  Ranking  of  Treatments,  in-depth  analyses 
further  supported  differences  in  performance  attributable 
to  color  treatment.  The  table  lists  the  rank  order  across 
the  top,  from  best  performance  to  worst.  The  first  column 
lists  the  characteristic  factors  and  performance  measures 
of  the  hypothesis  of  equal  mean  performance  which  were 
rejected  at  the  .05  significance  level.  The  entries  in 
the  table  are  the  codes  representing  the  color  treatment 
of  the  mixed  graphics  display.  They  are  ranked  according 
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Table  4.15 

RANKING  of  TREATMENTS 
by 

PERFORMANCE 

Groupings  of  Duncan/Tukey-Kramer 
X  *  .05 


- — . _ Color  Ranking 

Characteristic  vs  PerfonTT 


Female  vs  Total  Score 
Female  vs  Sum  Score 
Female  vs  Scene  2&3 
Col  Ex  vs  Total  Time 
Edu  Lv  vs  Post  Test 
CRT  Ex  vs  Long  Mem 

V  Game  vs  Total  Score 

V  Game  vs  Sum  Score 

V  Game  vs  Post  Test 


Best 

1 


Worst 

5 


CRT 

B/W 

PAS 

RND 

OUT 

CRT 

B/W 

PAS 

RND 

OUT 

CRT 

B/W 

PAS 

RND 

OUT 

RND 

B/W 

PAS 

CRT 

OUT 

CRT 

B/W 

OUT 

PAS 

RND 

CRT 

B/W 

RND 

OUT 

PAS 

CRT 

B/W 

PAS 

OUT 

RND 

PAS 

B/W 

CRT 

OUT 

RND 

CRT 

B/W 

OUT 

PAS 

RND 

to  the  top  line.  The  underlining  of  each  group  represents 
the  grouping  of  equal  means  suggested  by  Duncan  or 
Tukey-Kramer,  as  discussed  earlier. 

Analyzing  the  content  of  the  table  suggests  several 
trends.  If  there  was  evidence  of  significance,  the  means 
were  all  divided  into  only  two  intersecting  subgroups. 
The  top  ranking  treatment  was  consistently  chromatic,  with 
the  second  always  the  monochrome  intensity  pattern  (B/W) . 
Except  for  the  Color  x  Video  Game  vs  Total  Score  or  Sum 
Score,  the  monochrome  pattern  was  common  to  al 1  subgroups, 
consistently  being  the  second  best  in  the  better  group  and 
top  in  the  second  group.  Of  the  chromatic  treatments,  the 

CRT  colors  (CRT)  ranked  the  highest  most  often  while  the 

pastel  colors  (PAS)  were  middle  of  the  road  and  the  random 
CRT  (RND)  and  outline  (OUT)  ranked  at  the  worst  end.  For 

the  seven  cases  where  CRT  outscored  the  B/W,  all  which 

were  a  function  of  quality  of  decision,  the  average 
Increase  in  performance  was  9.2  per  cent.  However  in  the 
five  cases  where  B/W  outscored  CRT,  the  average  decrease 
In  score  was  about  the  same,  13.9  per  cent.  Of  particular 
note  is  that  model  which  had  decision  time  as  the 

performance  measure.  B/W  decreased  decision  time  over  CRT 
by  11.5  per  cent . 

Overal 1 ,  based  on  the  results  of  102  of  111  tested 
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hypotheses,  the  attribute  of  color  when  functionally 
Integrated  in  a  mixed  graphics  display  did  not 
significantly  affect  performance  in  either  direction. 
Quality  of  decision  was  not  improved  or  degraded  at  the 
.05  significance  level,  nor  was  decision  time.  The  data 
did  not  contain  sufficient  evidence  to  conclude  the 
chromatic  colors  have  a  unique  quality  for  functionally 
coding  information  in  a  DSS  over  a  monochrome  intensity 
pattern.  However,  If  color  is  Incorporated  functionally 
into  a  DSS  whose  interface  is  hardcopy,  the  CRT  color  set 
appears  to  be  the  best  in  general.  However,  it  should  not 
be  assumed  the  performance  of  the  decision  maker  with  an 
academic  engineering  background  will  be  s igni f icant 1 y 
affected. 

Conclusions  and  generalizations  based  on  these 
results  and  the  literature  as  well  as  recommendations  for 
further  research  in  this  area  are  offered  in  the  next 


chapter 


V.  CONCLUSIONS  AND  RECOMMENDATIONS 


Introduction 

Computer  generated  graphics  supporting  the  option  of 
color  have  become  the  heart  of  many  DSS  In  the  industrial, 
business  and  military  sectors.  However  knowledge  of  how 
to  integrate  this  color  technology  to  realize  maximum 
benefit  to  the  performance  of  the  decision  maker  has  not 
kept  pace  with  the  technological  advances  of  the  software 
and  hardware  supporting  it.  The  design  objective  of  a  DSS 
from  a  manager's  perspective  is  to  discover  the  optimum 
match  between  the  DSS  and  the  decision  maker  that  allows 
for  more  efficient  and  effective  performance.  The  primary 
concern  of  the  research  reported  here  was  to  Investigate 
the  relative  effectiveness  on  performance  when  Integrating 
color  functionally  into  a  DSS  whose  decision  maker  has  an 
academic  engineering  background,  solves  structured 
problems  based  on  deterministic  data  presented  in  a  mixed 
graphics  form  on  computer  generated  hardcopy  In  a 
managerial  role  of  control  of  resources. 

The  MIS  literature  advocates  empirical  research  is 
desperately  needed  in  this  area  and  recommends  the  small 
study  (Lucas,  1981)  laboratory  (Dickson  et  al.,  1977) 
approach.  Since  it  was  determined  the  objective  of  this 
study  could  be  satisfied  within  the  framework  of  a 
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laboratory  setting,  an  experiment  was  designed  using  this 
recommended  approach.  The  key  factors  of  a  DSS  definition 
as  given  by  Mason  and  Mitroff  (1973)  determined  the 
elements  of  the  experiment.  The  performance  measured  were 
two  resources  most  coveted  by  decision  makers:  time  and 
quality  of  decision.  These  two  variables  were  subdivided 
into  15  categories  for  more  rigorous  analysis  of  the 
research  question.  Three  categories  were  a  function  of 
time;  12  were  a  function  of  quality  of  decision.  Seven 
characteristics  of  decision  makers  that  may  have 
influenced  performance  were  included  in  the  design.  These 
included  age,  sex,  college  level  and  experience,  attitude 
toward  the  computer,  and  experience  with  computers  and 
video  games.  The  case  study  used  as  the  instrument 
represented  a  general  class  of  structured  problem  of 
managerial  resource  control.  The  mixed  graphics  form  was 
bar  charts  that  are  widely  used  on  existing  graphical  DSS. 
Five  alternatives  of  colors  were  functionally  integrated 
Into  the  graphics  for  comparison.  Four  of  these 
alternatives  were  chromatic  and  included:  the  colors 
preset  on  a  majority  of  CRTs  (CRT),  these  same  colors  but 
in  outline  as  opposed  to  filled  form  (OUT),  a  set  of 
colors  randomly  selected  from  all  colors  available  on  a 
majority  of  CRTs  (RND)  and  a  set  of  randomly  selected 
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pastel  colors  (PAS).  The  fifth  alternative  was  achromatic 
or  monochrome  and  was  comprised  of  randomly  selected 
intensity  patterns  (B/W)  available  in  many  commercial 

computer  graphics  software  packages.  The  man-machine 

interface  employed  was  computer  generated  hardcopy. 

Based  on  the  overall  objective  of  this  research  and 
the  variables  designed  into  the  experiment  as  discussed 
above,  hypotheses  were  formalized.  The  data  collected 
from  the  performance  of  120  subjects  were  used  to  define 
the  appropriate  mathematical  models  whose  derived 
statistics  were  used  to  rigorously  test  the  stated 

hypotheses. 

This  chapter  initially  reiterates  the  results  of 

examining  the  problem  statement  via  analyses  of  the  111 
hypotheses.  This  is  accomplished  by  offering  conclusions 
and  summaries  first  by  Individual  performance  variable, 
then  by  performance  variable  type  (decision  quality  and 
time)  and  finally  by  generalizing  to  the  overall  problem 
statement  and  objectives  of  this  research. 
Recommendations  based  on  recognized  limitations  of  this 
research  and  direction  from  as  well  as  voids  in  the  MIS 
1 Iterature  finish  this  chapter  and  conclude  this  research. 

Conclusions  and  Summary 

The  following  conclusions  and  summaries  are  based  on 


the  results  of  rigorous  analyses  of  the  appropriate 
statistics  derived  from  the  2-factor  fixed  effect  ANOVA 


model.  The  particular  statistics  used  and  their 
Just  if i cat  Ion  based  on  the  characteristics  of  the  model  as 
well  as  the  data  are  detailed  in  Chapter  IV,  Analysis  and 
Results.  The  final  results  are  consolidated  in  Table  5.1, 
Final  Results  of  Analysis  of  Color.  The  entries  in  this 
table  are  the  p  values  of  the  models  whose  performance 
variable  starts  the  row  and  whose  color  and  character i st t c 
factors  head  the  column.  The  p  value  entries  are  included 
on  1 y  for  those  mode  Is  which  i n-depth  ana  1 yses  v i a  Duncan , 
Tukey-Kramer  or  graphical  inspection  indicated  sufficient 
evidence  to  reject  the  nul I  hypothesis  that  performance 
was  not  affected  by  the  color  alternative  of  the  mixed 
graphics  display  at  the  .05  significance  level. 

The  conclusions  and  summaries  are  presented  as 
follows.  First  those  models  whose  performance  was  a 
function  of  decision  quality  are  discussed  with  a  general 
summary  offered.  Next  the  models  whose  performance  was  a 
function  of  time  are  presented  then  summarized.  Overall 
conclusions  and  generalizations  close  this  section. 

Decision  Quality  Score 

Total  Score.  Total  score  represented  the  overal 1 
decision  quality.  There  was  insufficient  evidence  to 


Tab  I e  5 . 1 

FINAL  RESULTS  of  ANALYSIS  of  COLOR 

p  Values  of  models 
Equal  Performance  Rejected  at *(=.05 


^s^Factors  Color  Color  Color  Color  Color  Color  Color 

Nv  &&&&&&& 

Age  Sex  Col  Ex  Edu  Lv  CRT  Ex  Atttt  VGame 

Performance 


*  Females  Only 

#  3£5  Yrs  Col  Exp  Only 


conclude  that  color  Influenced  the  overall  decision 

quality  for  all  models  except  two.  Females'  overall 
decision  quality  score  ranked  best  to  worst  when 

Interfacing  with  CRT,  B/W,  PAS,  RND  and  OUT  respectl vely. 
In-depth  analysis  showed  the  first  four  not  significantly 
different  as  well  as  the  last  four.  When  video  game 

experience  was  washed  out,  the  ranking  was  similar  to 
females:  CRT,  B/W,  PAS,  OUT  and  RND.  The  first  four  were 

insignificant  as  were  the  last  three. 

Overa 1 1  tota l  dec i s i on  qua 1 i ty  was  not  affected  by 
color  of  the  graphics  display.  When  there  was  evidence  to 
suggest  a  significant  difference,  the  rankings  were 
similar:  CRT,  B/W  and  PAS.  However,  there  was 

insufficient  evidence  to  suggest  chromatic  (CRT)  which 
ranked  best  was  significantly  better  than  monochrome 
(B/W),  ranked  second.  The  actual  average  increase  of  CRT 
score  over  B/W  was  8.6  per  cent. 

Sum.  Sum  score  represented  quality  of  decisions 
based  on  information  which  was  either  currently  or 
previously  available  to  the  decision  maker.  As  with  the 
total  score,  for  all  but  the  same  two  models,  there  was 
insufficient  evidence  to  conclude  color  significantly 
affected  performance.  This  was  expected  since  sum  score 
is  a  subset  of  total  score.  Females'  performance  were 
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consistent  in  Its  differential  ranking:  CRT,  B/W,  PAS, 
RND  and  OUT.  In-depth  analysis  grouped  the  first  four  as 
a  set  and  the  last  four  as  a  set.  Chromatic  (CRT)  was  not 
significantly  different  from  monochrome  (B/W)  though  it 
outscored  monochrome  by  11.6  per  cent. 

After  washing  out  the  variability  in  performance  due 
to  video  game  experience,  decision  makers  sum  score  ranked 
best  to  worst  for  PAS,  B/W,  CRT,  OUT  and  RND.  In-depth 
analyses  grouped  the  first  four  as  insignificant  from  one 
another  and  the  last  three  as  i ns fgnif leant  from  each 
other.  Chromatic  (PAS)  was  not  significantly  different 
from  monochrome  <B/W)  though  it  outscored  it  by  1.3  per 
cent.  Unlike  all  above  discussions,  the  PAS  alternative 
ranked  first.  CRT,  which  previously  ranked  first  was 
outscored  by  B/W  by  2.5  per  cent,  which  was  not 
stat i st i ca 1 1 y  s 1 gn i f i cant . 

Scene  1 .  Scene  1  quality  score  represented  decisions 
in  which  no  memory  was  required.  In-depth  analysis 


indicated  insufficient  evidence  to  concluded  significant 
differential  affect  on  no  memory  decision  quality  by  any 


of  the  treatments. 


Scene 


L.  Scene  2  &  3  quality  score  represented 


decisions  requiring  information  that  was  present  for 
review  as  well  as  memory  of  what  was  presented  or  decided 


during  the  previous  scenario(s).  All  models  except  female 
performance  showed  indifference  to  the  color  of  the 
graphics  displays.  Females  performed  these  types  of 
decisions  best  to  worst  when  interfacing  with  CRT,  B/W, 
PAS,  RND  and  OUT  respectively.  This  ranking,  which 
in-depth  analysis  showed  the  first  four  insignificant  from 
each  other  as  well  as  the  last  four,  was  consistent  with 
previous  rankings  by  female  performance.  Chromatic  (CRT) 
outscored  monochrome  by  14.5  per  cent  which  was  not 
stat 1 st i ca 1 1 y  s i gn 1 f 1  cant . 

Post  Test.  Post  test  quality  of  decision  represented 
decisions  for  which  none  of  the  graphics  displays,  problem 
facts  or  previous  information  were  available  and  the 
session  was  not  timed.  It  required  memory  and/or 
manipulation  of  previously  given  information  or  decisions. 
After  in-depth  analyses  were  accomplished,  only  two  of  the 
eight  models  contained  evidence  to  suggest  the  color  of 
the  graphics  display  affected  post  test  decision 
performance:  those  which  accounted  for  the  variability 

due  to  education  level  and  for  video  game  experience. 
Both  models  were  consistent  with  each  other  in  their 
respective  rankings  of  color  based  on  post  test 
performance:  CRT,  B/W,  OUT,  PAS  and  RND.  The  first  three 


were  grouped  as  insignificant  as  were  the  last  four 
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After  the  variabi 1 ity  in  performance  attributed  to 
education  level  or  video  game  experience  was  accounted 
for,  chromatic  (CRT)  out scored  monochrome  (B/W)  by 

approximately  4  per  cent  and  7.8  per  cent  respectively. 
This  represented  an  average  increase  of  approximately  5.9 
per  cent  in  post  test  score  using  CRT  over  B/W.  However 
this  difference  was  not  statistically  significant  at  the 
.05  level.  The  other  six  models  whose  performance 
measured  post  test  quality  score  contained  Insufficient 
evidence  to  support  any  of  the  five  alternatives  were  more 
effective  than  any  of  the  others. 

0  Read  > 1 .  Dfrect  read  from  more  than  one  graph 
represented  decisions  requiring  the  processing  of 
information  from  at  least  two  graphs  that  were  available 
for  review  by  the  decision  maker.  None  of  the  eight 

models  investigated  were  supported  by  data  which  suggested 
color  differentially  affected  this  type  of  performance. 

Extrap  1 .  Extrapolation  from  one  graph  represented 
that  type  of  decision  which  required  manipulation  of  the 
information  from  one  graphics  display  which  was  available 
for  review  by  the  decision  maker.  In-depth  analysis 

Indicated  Insufficient  evidence  to  conclude  color 
differential ly  affected  this  type  of  decision  performance. 


Extrapolation  from  more  than  one  graph 


was  comprised  of  decisions  which  required  computation  of 


Information  extracted  from  two  or  more  graphs  but  did  not 
require  memory.  In-depth  analyses  resulted  In  falling  to 
reject  that  performance  based  on  all  five  treatments  was 
the  same. 

Short  Mem .  Short  term  memory  score  represented 
dec  1 s 1 on  qua 1 1 ty  which  mandated  memory  of  prev 1 ous 1 y 
presented  Information  and/or  decisions.  Computation  was 
not  required.  The  case  study,  current  Information  and 
graphics  displays  were  available  for  review  by  the 
decision  maker.  For  all  models  under  consideration,  there 
was  Insufficient  evidence  to  conclude  that  a  particular 
color  differentially  affected  a  decision  maker's  short 
term  memory  quality  of  decision  performance. 

Long  Mem.  Long  term  memory  decisions  required  memory 
of  previously  given  Information  or  stated  decisions.  No 
computations  were  necessary.  Unlike  the  short  term  memory 
decisions,  neither  the  problem  facts  or  any  graphics 
displays  were  available  for  review.  This  Is  the  first  of 
the  decision  types  that  was  not  timed.  Except  for  the 
model  which  accounted  for  the  variation  in  performance  due 
to  CRT  experience,  there  was  insufficient  evidence  to 
conclude  the  color  of  the  graphics  display  significantly 
Influenced  performance  which  required  long  term  memory. 
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attention  of  the  decision  maker. 

In-depth  analysis  showed,  that  after  accounting  for 
the  effects  on  performance  attributed  to  CRT  experience, 
the  color  of  the  graphics  display  did  affect  long  term 
memory  performance.  Performance  was  best  to  worst  when 
the  displays  were  CRT,  B/W,  RND,  OUT  and  PAS  respectively. 
The  first  three  were  grouped  as  being  insignificant  from 
each  other  as  were  the  last  four.  Chromatic  (CRT) 
out scored  monochrome  (B/W)  by  9.8  per  cent,  which 
statistically  did  not  represent  a  significant  difference. 

Long  Mem  Extrap .  Long  term  memory  extrapolation  type 
decisions  represented  that  class  of  decisions  which 

requires  manipulation  of  information  or  decisions 
previously  given.  Neither  the  basic  facts  or  any  of  the 
graphics  displays  were  accessible  to  the  decision  maker. 
These  types  of  decisions  were  not  timed  and  represented 
the  end  of  the  decision  making  session.  Across  the  board, 
there  was  insufficient  evidence  to  reject  the  hypothesis 
that  any  of  the  five  color  alternatives  varied  from 
another  when  quality  of  decision  was  measured  by  long  term 
memory  performance.  This  may  also  indicate  that  none  of 


the  alternatives  were  more  influential  on  maintaining 


subject  attention  better  than  another  set. 

Summary  Based  on  DecJslon  Quality 

Of  the  86  models  that  were  examined  whose  performance 
was  a  function  of  quality  of  decision,  the  majority  (78) 
of  them  resulted  in  insufficient  evidence  to  conclude  the 
color  attribute  of  the  mixed  graphics  display 
significantly  affected  performance  at  the  .05  significance 
level.  These  results  are  inconsistent  with  the 
theoretical  and  subject  literature,  which  purport  color 
when  Integrated  Into  information  display  allows  for  better 
decisions  and  memory.  However,  these  findings  are 
concurrent  with  the  objective  literature  which  was 

summarized  by  stating  chromatic  color  does  not  appear  to 
have  a  unique  quality  for  coding  information  over  other 
achromatic  codes  (Christ  and  Corso,  1983).  For  some 

tasks,  it  may  be  beneficial;  in  others  it  may  be 

detrimental.  In  this  research,  chromatic  color  did  not 

influence  performance  in  a  way  different  than  the 
achromatic  code  for  a  majority  of  the  models.  Where  it 
did,  the  Influence  was  always  in  the  positive  direction  as 
it  consistently  ranked  better  than  achromatic,  though 
never  with  an  Interval  which  was  statistically  significant 
at  the  .05  significance  level. 


Unlike  any  other  study  reported  in  the  literature, 
this  research  employed  several  chromatic  alternatives  in 
the  experimental  design.  This  was  done  to  investigate  if 
It  was  just  the  presence  of  color  that  affected 
performance  or  a  particular  set.  As  discussed  previously, 
decision  quality  performance  was  not  affected  differently 
by  any  of  the  five  alternatives.  Based  on  these  results, 
it  can  be  concluded  that  despite  the  color  set  used, 
performance  was  not  affected  differently  than  when  a 
monochrome  intensity  pattern  was  used. 

After  examining  the  eight  models  whose  data  contained 
sufficient  evidence  which  suggested  chromatic  color 
affected  decision  quality  performance,  a  pattern  was 
detected.  In  seven  of  the  eight  models,  the  CRT 
alternative  correlated  with  the  best  performance,  giving 
an  average  of  approximately  9.2  per  cent  increase  in 
decision  quality  score  over  the  second  place  monochrome 
intensity  pattern.  This  difference  was  not  statistically 
significant.  For  the  eighth  case,  PAS  ranked  best, 
showing  a  1.3  per  cent  increase  over  monochrome.  Based  on 
these  results  it  is  concluded  that  if  a  color  alternative 
is  chosen,  the  data  suggest  the  colors  preset  on  the 
majority  of  CRTs  be  the  ones  used  to  maximize  performance. 
However,  since  not  all  nor  a  random  sample  from  all  color 
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alternatives  available  for  graphics  displays  were 
investigated,  the  generalization  of  this  conclusion  Is 
limited.  When  choosing  among  the  four  chromatic 
a  1 ternat i ves ,  quality  of  performance  was  most  often 
positively  affected  by  the  colors  currently  preset  on  a 
majority  of  CRTs  supporting  color:  green,  red,  ye l low, 
blue,  dark  blue,  cyan,  black  and  white. 

Decision  Time 

Total .  Total  or  Interactive  time  represented  the 
time  (In  seconds)  needed  to  make  those  decisions  where  the 
basic  problem  facts  and  graphics  displays  were  available 
for  review.  Eight  of  the  nine  models'  data  considered 
contained  insufficient  evidence  to  conclude  chromatic 
displays  differentially  affected  the  over  all  time 
required  to  make  decisions.  For  those  decisions  makers 
who  had  at  least  three  but  less  than  five  years  college 
experience,  decision  time  was  shortest  when  the  visual 
display  was  functionally  colored  with  the  RND  alternative, 
with  B/W,  PAS,  CRT  and  OUT  being  the  remaining  ranking 
order.  In-depth  analysis  showed  RND  out scored  B/W  by  2.7 
per  cent,  while  B/W  shortened  total  time  by  11.5  per  cent 
over  CRT.  RND  and  B/W  were  not  significantly  different 
from  each  other,  nor  were  B/W,  PAS,  CRT  and  OUT. 

No  Memory.  No  memory  time  was  the  time  (in  seconds) 


elapsed  when  formulating  decisions  which  required  no 
memory  but  extraction  of  information  currently  available 


to  the  decision  maker.  Results  from  analyses  of  the  no 
memory  time  data  were  consistent  across  every  case. 
Functional  chromatic  Information  code  did  not  affect 
decision  time  when  compared  to  monochrome  intensity 
pattern . 

Memory.  Memory  time  was  the  time  (In  seconds)  spent 
by  the  decision  maker  when  making  decisions  which  required 
memory  for  some.  Intermingled  with  others  which  did  not. 
The  problem  facts  and  current  graphics  displays  were 
available  for  consulting,  but  the  necessary  information 
was  not  always  currently  available.  Analysis 

overwhelmingly  concluded  the  color  of  the  graphics  display 
did  not  significantly  affect  memory  time. 

Summary  Based  on  Decision  Time 

Of  the  25  models  that  were  examined  whose  performance 
was  a  function  of  decision  time,  all  but  one  resulted  in 
Insufficient  evidence  to  conclude  the  color  attribute  of 
the  mixed  graphics  display  significantly  affected  decision 
time  at  the  .05  significance  level.  Like  the  results 
based  on  decision  quality,  these  are  Inconsistent  with  the 
theoretical  and  subjective  literature  which  contend  color 
significantly  speeds  the  information  processing  thus 
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decision  time.  The  single  case  in  which  in-depth  analyses 
indicated  a  difference  on  performance  attributed  to  color. 


the  best  chromatic  set  decreased  decision  time  by  only  2.7 
per  cent,  but  monochrome  showed  a  decrease  in  decision 
time  by  11.5  per  cent  over  the  suggested  chromatic  scheme 
of  CRT  based  on  decision  time  performance.  Based  on  this, 
it  is  concluded  that  chromatic  code  does  not  positively 
influence  a  decision  maker's  performance  time,  whether 
this  time  is  measured  as  an  overall  time  or  time  required 
to  make  no  memory  or  a  mix  of  no  memory /memory  decisions. 
Overall  Conclusions  Generalized  to  Problem  Statement 

The  primary  objective  of  this  research  was  to 
investigate  the  relative  effectiveness  on  performance  when 
functionally  integrating  color  into  a  DSS  whose  decision 
maker  has  an  academic  engineering  background,  solves 
structured  problems  based  on  deterministic  data  presented 
in  a  mixed  graphics  form  on  a  computer  generated  hardcopy 
In  a  managerial  role  of  control  of  resources. 

Overall,  based  on  the  results  of  102  of  111  tested 
hypotheses,  the  attribute  of  color  when  functionally 
integrated  in  mixed  graphics  hardcopy  display  did  not 
significantly  affect  performance  in  either  direction  of  a 
decision  maker  with  an  engineering  background.  Quality  of 
decision  was  not  improved  or  degraded  significantly  at  the 
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.05  significance  level,  nor  was  decision  time. 

In  the  nine  models  where  the  data  contained  evidence 
that  the  color  alternative  affected  performance,  the  CRT 
alternative  ranked  best  in  seven  instances,  all  which  were 
a  function  of  decision  quality.  The  average  improvement 
was  approximately  9.2  per  cent.  Specifically,  females 
performed  best  when  interfacing  with  the  CRT  alternative, 
but  It  was  never  a  significant  improvement  over  the 
monochrome  (B/W)  Intensity  pattern. 

In  the  other  two  models  where  a  difference  in 
performance  was  suggested,  a  chromatic  code  ranked  first, 
but  it  varied  (RND,  PAS)  and  was  not  CRT.  The  improvement 
was  never  significant  with  respect  to  the  monochrome. 
Monochrome  intensity  pattern  always  ranked  second.  It 
showed  a  11.5  per  cent  improvement  in  performance  time  and 
only  a  2.5  per  cent  improvement  for  sum  quality  score  over 
the  CRT  alternative. 

In  general,  these  results  are  concurrent  with  other 
empirical  studies  on  the  effects  of  color  when  integrated 
into  visual  displays.  The  data  do  not  contain  sufficient 
evidence  to  conclude  the  chromatic  colors  have  a  unique 
quality  for  functionally  coding  information  in  DSS  over  a 
monochrome  intensity  pattern.  However  if  color  is 


functionally  added  to  a  hardcopy  mixed  graphics  display 
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the  preset  colors  available  on  a  majority  of  the  current 
CRTs  appear  to  be  the  best  overall.  However,  the  manager 
should  not  assume  increased  efficiency  in  a  decision 
maker's  performance  because  of  the  functional  color 
attribute. 

Areas  for  Further  Research 

The  DSS  Is  composed  of  five  key  factors:  a  person  of 
a  certain  psychological  type,  a  class  of  problems,  data  of 
a  certain  form,  a  particular  man-machine  interface  and  an 
organizational  setting.  This  research  is  limited  in  scope 
to  the  decision  maker  with  an  academic  engineering 
background,  who  solves  deterministic  problems  whose  data 
is  presented  in  mixed  graphics  form  on  a  hardcopy 
interface  when  assuming  a  managerial  control  of  resources 
role.  Changing  any  one  factor  opens  the  door  for  further 
research.  Based  on  the  recommendations  of  the  MIS 
1 iterature  that  the  approach  to  research  be  that  of 
smaller  studies  with  emphasis  on  the  laboratory  setting, 
the  users  of  DSS  supporting  color,  the  technology  of  the 
color  itself  and  the  limitations  of  this  research,  the 
following  recommendations  are  made  for  further  research. 
RecpmmendatJQfis 

I.  The  CRT  is  gaining  as  the  primary  man-machine 
Interface  in  the  DSS  with  the  user  friendly,  available  and 
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affordable  computer  system  technology.  Color  most  likely 
will  become  a  standard  option  at  a  cost  not  much  different 
from  that  of  monochrome.  How  does  color  affect  the 
performance  of  the  decision  maker  whose  interface  Is 
primarily  the  CRT?  If  differences  exist,  are  they  the 
same  for  either  type  of  monochrome  (black/whlte  and  green 
phosphorous)  CRT  Interface?  Performance  as  a  function  of 
memory  will  be  an  Issue  since  once  the  screen  Is  erased 
for  the  next  graphics  output,  the  first  Is  not  available 
as  In  hardcopy.  Performance  as  a  function  of  time  may 
also  be  a  key  Issue.  If  memory  is  enhanced  by  color, 
fewer  graphics  will  have  to  be  recalled  to  the  screen  for 
reexamination,  thus  the  decision  time  shortened.  With  the 
dark  background  of  most  CRTs  as  opposed  to  the  white  of 
the  hardcopy,  and  the  horizontal  eye  level  of  the  CRT 
versus  the  45  degree  to  80  degree  angle  for  hardcopy  on 
the  desk,  the  Impact  of  color  may  vary  with  Interface. 

2 .  Though  qua  1 1 ty  of  dec  1 s 1  on  and  1 1 me  are  the  two 
most  coveted  resources  of  most  decision  makers,  others 
have  high  utility.  Does  color  in  the  DSS  make  the  job 
more  exciting  thus  give  the  decision  maker  added 
motivation?  The  subjective  literature  purports  color 
commands  a  longer  attention  span.  Does  it?  The 
subjective  literature  also  claims  color  causes  less 


fatigue.  Does  color  reduce  fatigue  In  a  short  Interval  or 
over  an  extended  period  of  time?  Does  color  affect  the 
performance  differently  than  monochrome  Irrespective  of 
the  fatigue  of  the  decision  maker  prior  to  Interfacing 
with  the  DSS? 

3.  If  the  amount  of  viewing  time  for  the  mixed 
graphics  display  is  limited,  in  what.  If  any,  situations 
does  color  aid  the  decision  maker?  This  study  was 
decision  maker  paced,  but  not  all  situations  are. 

4.  Christ  (1977)  criticized  that  all  the  research  on 
color  to  date  was  accomplished  in  an  isolated  laboratory 
setting  in  which  the  inexperienced  subject  dedicated  total 
attention  to  performing  simple  discrete  task(s).  This 
research  employed  a  complex  task,  similar  to  one  which 
most  subjects  had  previously  performed  on  their  own.  In 
their  workplace  or  In  an  academic  setting.  However, 
unlike  the  decision  maker's  environment,  the  task  was 
accomplished  In  a  testing,  uninterrupted  classroom.  The 
environment  allowed  total  energies  to  be  dedicated  to  the 
task.  Recommend  a  similar  study  be  accomplished  which 
Includes  in  Its  design  Interruptions  that  simulate  the 
decision  maker's  workforce  environment. 

5.  This  study  sampled  a  population  within  an 
academic  environment.  The  thought  processes  and 
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strategies  when  solving  the  case  study  In  a  classroom 
testing  environment  may  be  different  from  those  In  the 
engineering  workforce.  Recommend  analysis  of  the  affects 
of  color  on  performance  of  decision  makers  currently  In 
the  workplace  who  are  not  Involved  In  academic  training. 
Special  attention  in  the  analyses  should  be  given  to  the 
different  ages*  experience  levels  and  strata  of  management 
as  each  may  use  the  same  DSS  from  a  different  perspective. 

6.  Are  decision  makers  with  academic  engineering 
backgrounds  affected  differently  by  color  than  decision 
makers  with  nonengineering  backgrounds?  Those  decision 
makers  with  an  artistic,  archi tecture,  educational  etc 
background  may  be  stimulated  differently  by  color 
displays.  Research  to  date  is  void  In  this  area. 

7.  This  research  did  not  follow  up  on  subjects  to 
test  memory  over  an  extended  period.  Often  managers  make 
decisions  based  on  Information  which  they  reviewed  days  or 
weeks  earlier.  Does  color  affect  memory  over  an  extended 
period  of  time? 

8.  Not  all  problems  faced  by  decision  makers  using  a 
DSS  fall  Into  the  general  class  of  problems  represented  by 
the  inventory  control  problem  whose  data  were  or  were 
considered  deterministic.  Does  color  relay  information 
differently  for  different  types  of  problems?  The 
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experimental  literature  reports  most  decision  makers 
remove  the  stochastic  nature  from  their  data  prior  to 
their  use.  Does  information  processed  in  color  affect 
this  process? 

9.  Research  on  the  effects  of  ambient  lighting  when 
using  the  CRT,  with  recommendations  on  its  levels  has  been 
reported  in  the  literature.  However,  these  studies  have 
been  mainly  directed  to  CRT  visual  displays  supporting 
text,  with  emphasis  on  monochrome  displays.  Do  the 
recommendations  for  the  ambient  lighting  remain  the  same 
for  CRT  displays  supporting  color  graphics  displays? 

10.  Vertical  bar  mixed  graphics  displays  as  used  in 
this  research  are  one  of  the  most  widely  forms  of  displays 
used.  However,  they  are  not  the  only  type.  Does  color, 
when  functionally  added  to  other  types  of  graphics 
displays  such  as  line  graphs,  pie  charts,  horizontal  bar 
charts,  etc  affect  the  performance  differently  than  the 
monochrome  patterns? 

11.  Often  information  on  which  decisions  are  based 

Is  presented  in  a  meeting  environment  with  the  interface 
being  viewgraphs  or  slides,  usually  read  at  a  distance. 
As  a  result  of  the  technology,  the  viewgraphs  and/or 
slides  are  Images  of  the  CRT  displays.  How  does  color, 
when  1 ncorporated  in  this  type  of  dec  1 s 1  on  maker ' s 


interface,  affect  the  performance  of  a  decision  maker  in 
the  short  and  long  term? 


12.  Theoretically,  color  should  enhance  performance. 


The  prob 1 em  may 

lie  in 

the 

fact 

it 

has  not  yet 

been 

discovered  how  or 

what 

colors 

to 

use 

that  will 
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of 

how 

the 

bra  i  n 

processes 

the 

information  code.  What  colors?  How  many  colors?  How 
much  of  each  color?  are  questions  which  have  not  been 
rigorously  addressed  and  remain  unanswered. 

Conclusions 

There  continues  to  be  a  large  gap  of  quantitative 
information  regarding  how  to  integrate  color  in  a  DSS  so 
it  benefits  the  user.  This  reported  study  attempted  to 
narrow  this  gap  of  knowledge.  Research  efforts  like  this 
and  those  suggested  above  may  lead  to  discovering  a  way  of 
allowing  color  to  enhance  the  effectiveness  of  a  DSS. 
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BACKGROUND  AND  EXPERIENCE 


1.  Enter  your  student  ID  number: 

2.  How  long  have  you  been  a  university  student? 

A.  Less  than  3  months  E.  3  years  (36-47  months) 

B.  5-11  months  F.  4  years  (48-59  months) 

C.  1  year  (12-23  months)  G.  3  years  (60-71  months) 

D.  2  years  (24-35  months)  H.  6  years  or  more 


3.  What  is  your  highest  level  of  education  now? 

A.  GED  certificate  or  high  school  equivalency 

B.  High  school  graduate 

C.  One  or  two  years  of  college  or  vocational  school  (include 
Associate  Degree) 

D.  More  than  two  years  of  college 

E.  College  degree  (BA,  BS,  or  equivalent) 

F.  Graduate  study  but  no  graduate  degree 

G.  Master’s  degree 

H.  Doctor’s  degree  (Ph.D. ,  M.D.,  LL.B. ,  Ed.D.,  etc.) 


4.  In  what  college  are  you  enrolled? 

A.  College  of  Engineering  and  Applied  Sciences 

B.  College  of  Architecture 

C.  College  of  Liberal  Arts 

D.  College  of  Business  Administration 

E.  College  of  Education 

F.  College  of  Fine  Arts 

G.  College  of  Law 

H.  College  of  Nursing 

I.  College  of  Public  Programs 

J.  School  of  Social  Work 

K.  Other 


5.  What  is  your  age  in  years? 


6.  Are  you  male  or  female? 

A.  Male 
8.  Female 


7.  Hava  you  aver  worked  on  a  computer  previous  to  today? 


A.  Yes 

B.  No 


3.  How  do  you  -feel  about  working  with  the  computer? 

A.  Very  comfortable 

B.  Comfortable 

C.  Hesitant 

0.  Uncomf ortable 
E.  Very  uncomfortable 


9.  Have  you  ever  played  video  games? 

A.  Yes 

B.  No 


10.  How  often  did  you  usually  play  video  games  in  the  last  six 
months? 

A.  Never  play 

B.  0-3  hours/week 

C.  3-6  hours/week 

D.  6-9  hours/week 

E.  10  or  more  hours/week 


11.  Are  the  video  games  you  play  presented  in 

A.  Breen  and  black 

B.  Multicolor 

C.  Black  and  white 

D.  Other 

E.  1  have  never  played  video  games 


12.  How  often  did  you  usually  interact  with  the  computer  using  a 
CRT  screen  display  for  non-video  game  activities  in  the  last  six 
months? 

A.  Never 

B.  0-3  hours/week 

C.  3-6  hours/week 

D.  6-9  hours/week 

E.  10  or  more  hours/week 


Appendix  B 
Case  Study 


PROBLEM 


Facts 

You  are  the  Sales  Manager  -for  the  U-Sell-Um  Bike 
Shop.  Your  store  is  located  near  the  University.  It  is 
time  to  order  your  bike  stock  for  the  summer  season.  It 
is  your  job  to  determine  what  type  of  bike  and  how  many 
you  should  buy  in  order  to  maximise  profits  for  the 
U-Sell-Um  Bike  Shop.  There  are  several  facts  that  may 
affect  your  final  decisions. 

1.  There  are  three  models  of  bikes  on  the  market: 

1.  Campus  Bikes 

2.  3-Speed  Bikes 

3.  Deluxe  Bikes 

2.  The  selling  price,  purchase,  storage,  understock 
(shortage)  and  overstock  (overage)  costs  are  known  for 
each  model . 

3.  Bikes  can  be  purchased  from  their  respective 
manufacturers  only  in  multiples  of  20  bikes. 

4.  Because  of  advertising,  literature,  sales 
expertise  and  special  bike  rack  displays,  only  one  model 
of  bike  can  be  stocked  in  your  store  during  a  season. 

5.  Presently  you  have  room  to  display  a  maximum  of 
30  bikes.  A  friend  has  a  warehouse  that  has  room  to  store 
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up  to  10  more  bikes  free  of  charge.  However  if  you  have 
more  than  40  bikes  in  your  inventory,  you  will  have  to 
rent  space  for  all  bikes  you  do  not  have  room  for  in  your 
shop  display.  The  rental  fee  is  $5/bi ke/season  no  matter 
how  long  it  is  stored. 

6.  Unless  the  environment  changes  (customers  or 
competition)  the  demand  for  all  styles  of  bike  for  any 
season  is  33. 

7.  Presently  your  shop  is  sold  out  of  bikes  and  does 
not  have  any  unfilled  orders. 

As  a  business  person,  your  goal  is  to  maximize  your 
profits  when  considering  the  above  information. 


Go  on  to  the  next  page. 
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Instructions 

Revenue  and  cost  in-formation  concerning  the  three 
bike  models  is  given  on  computer  generated  graphics  sheets 
located  in  your  packet.  This  in-formation  is  in  bargraph 
format.  Each  model’s  figures  are  shown  on  a  separate 
screen.  You  have  the  option  of  repeatedly  reviewing  any 
of  the  models’  bargraphs  during  each  case  study. 

The  format  on  the  graphics  sheets  for  every  model’s 
information  is  the  same.  The  title  indicates  the  season, 
model  and  demand.  Beneath  the  title  is  the  legend  which 
gives  the  per  bike  figures  and  the  key  to  reading  the 
charts.  Linder  the  legend  are  two  charts.  The  bargraph 
chart  on  the  left  represents  the  breakdown  of  revenues  and 
costs  when  buying  two  different  quantities  of  bikes  (in 
multiples  of  20).  The  first  set  of  bars  represents 
stocking  fewer  than  the  demand  (understocki ng )  while  the 
second  set  represents  stocking  more  than  the  demand 
(overstocking).  The  right  bargraph  chart  sums  the  left 
chart’s  revenues  and  costs  into  total  profit  or  loss 
figures  for  each  of  the  two  stocking  quantities.  (See 
f i gure  bel ow. ) 
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regional  manager,  Ms.  Bucks.  At  that  time  you  will  not 
have  the  graphical  information  available  for  review. 

If  you  have  any  questions,  please  ask  the 
Exper i menter  now.  After  any  questions  have  been  answered, 
review  the  Problem  Facts  and  wait  for  the  Experimenter  to 
tell  you  to  begin. 
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CASE  l:  SUMMER  POLICY  PURChttSE  DECISIONS 


1.  Based  an  the  currant  aalaa  inf or nation,  which  type  of  bike  nodal 

will  you  stock  to  maximize  your  profits? 

A.  Campus  Bike  Modal 

B.  3-Speed  Bike  Model 

C.  Deluxe  Bike  Model 

2.  How  many  of  that  type  would  you  order  for  the  summer? 

A.  20 

B.  40 

3.  If  for  some  reason  you  stocked  Campus  Bikes*  what  is  your 

estimate  of  your  total  profits? 

A.  S210 

B.  *340 

C.  *300 

D.  S82S 

4.  Which  of  the  following  pairs  represents  the  minimum  and  maximum 

dollar  figures  associated  with  overstocking? 

A.  (200,  600) 

B.  (  90,  320) 

C.  <  70,  360) 

D.  < 130,  360) 

E.  (200,1200) 

3.  Based  on  the  graphical  information  which  rule  of  thumb  best 
describes  the  decision  on  the  purchase  of  any  of  the  bike 
mqdels? 

A.  Purchase  less  then  the  demand  when  shortage  costs  are 

less  than  overstocking  costs. 

B.  Smaller  costs  guarantee  more  profits. 

C.  More  inventory  is  always  more  profitable. 

D.  None  of  the  above  statements  are  appropriate. 

6.  Warehouse  eost  associated  with  stocking  the  3-Speed  Bikes  is  a 
maximum  of 

A.  •  70. 

B.  «  260. 

C.  »  600. 

0.  *1200. 

E.  None  as  the  warehouse  is  not  used 


7.  If  you  could  cut  costs  somew here  by  10%  despite  tha  model  of 

bike,  what  coat  would  ba  most  profitable  to  cut? 

A.  Purchase  Cost 
S.  Shortage  Cost 

C.  Overstock  Cost 

D.  Cutting  any  of  tha  above  costs  by  107.  would  have  the 

same  affect. 

8.  If  advertising  would  increase  the  demand  from  33  to  37  Deluxe 

Bikes,  what  is  the  MAXIMUM  you  would  be  willing  to  spend  on 
advertising? 

A.  None  as  advertising  would  never  be  profitable 

B.  S300 

C.  S400 

D.  SSOO 

E.  *600 

9.  If  you  understock  the  Deluxe  Bike,  what  are  your  expected 

shortage  costs? 

A.  *  320 

B.  *  600 

C.  *1200 

D.  None  as  no  shortages  are  expected 


You  are  finished  with  this  analysis.  Please  raise  your  hand  to 
indicate  to  the  Experimenter  you  are  ready  for  the  next  case  study. 
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SUMMER:  DEMAND  33  3-SPEED  BIKES 


Q  TOTAL  SALES  PRICE  RT  *  50/BIKE 

9  TOTAL  PURCHASE  COST  AT  $  30/BIKE 
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SUMMER:  DEMAND  33  DELUXE  BIKES 
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CASE  2  SCENARIO:  FALL  INVENTORY 


Your  manuf acturers’  representati ves  have  contacted  you 
and  are  requiring  you  to  order  now  for  both  your  Summer 
shipment  and  Fall  shipment.  The  Fall  demand  will  be 
affected  by  the  University’s  new  parking  policy.  Parking 
fees  will  be  higher  and  the  close-in  parking  lots  will  be 
reserved  for  faculty,  staff  and  visitors  only.  The 
following  facts  should  be  considered  when  determining  the 
Fall  purchase  order  for  bikes. 

1.  The  demand  for  the  Fall  will  change  from  that  of 
the  summer  due  to  the  University’s  new  parking  policy. 
University  students  prefer  the  cheaper  Campus  bike. 


will  store  up  to  10  more  for  you  in  his  warehouse  for 


free.  However  if  you  have  more  than  40  in  your  inventory, 


you  will  have  to  rent  space  for  ALL  bikes  you  do  not  have 


room 


for  in  your  shop  display 


The  rental  fee  is 
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$5/bi ke/season  no  matter  how  long  it  is  stored. 

Initially  you  are  interested  only  in  the  Fall  figures 
that  reflect  the  change  in  demand  due  to  the  University's 
new  parking  policy.  The  Summer  inventory  figures  are  not 
of  concern.  The  revenue/cost  and  profit  bargraphs  have 
been  updated  to  reflect  the  Fall  demand  options.  The 
three  presentations  show  the  Fall  options  without 
considering  the  impact  of  the  Summer  purchase.  Using 
these  graphs  answer  Case  2  Purchase  Decision  Questions. 
Review  the  questions  before  you  start  the  analysis. 
Remember  all  questions  are  based  on  the  information  given 
on  the  graphics  sheets. 
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CASE  2:  FALL  POLICY  PURCHASE  DECISIONS 
SUMMER  INVENTORY  NOT  CONSIDERED 


1.  What  inventory  purchase  would  maximize  your  profits  for  the 

Fall? 

A.  Campus/60 

B.  Campus/80 

C.  3-Speed/ 20 

D.  3-Speed/40 

E.  Deluxe/20 

F.  Deluxe/40 

2.  Did  the  change  of  demand  of  the  different  models  due  to  the 

University’s  new  parking  policy  affect  the  model  of  bike  you 

will  stock  in  the  Fall? 

A.  Yes,  the  change  in  demand  changes  the  model  I  will  stock 

B.  No,  the  change  in  demand  makes  the  model  I  planned  for 

the  Summer  even  more  profitable 

3.  What  is  the  MAXIMUM  you  would  pay  to  expand  your  display  area  if 

the  future  demand  remains  the  same  as  in  the  Fall? 

A.  S  0.  No  extra  display  space  is  needed 

B.  • 1 50 . 

C.  *230. 

4.  How  do  the  Fall  profits  compare  to  the  Summer  profits?  (Answer 

each  question  T  for  True  or  F  for  False.) 

T  F  Understocking  remains  the  rule  of  thumb  as  it  is 

more  profitable  than  overstocking  for  all  models. 

T  F  The  minimum  inventory  purchase  considered  during 

both  seasons  was  20  bikes  which  always  guaranteed 
profits. 

T  F  Warehouse  costs  do  not  affect  the  total  profits  of 

the  most  profitable  model  for  either  season. 

T  F  If  your  inventory  policy  was  chosen  by  Lady  Luck 

you  have  a  chance  of  going  in  debt  during  each  season. 

T  F  It  would  be  worth  spending  SlOO/season  on 

advertising  if  it  guaranteed  the  depletion  of  your 
stock. 

You  are  finished  with  this  analysis.  Please  go  on  to  the  next  case 

study. 
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FALL  FIGURES:  DEMAND  67  CAMPUS  BIKES 
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FALL  FIGURES:  DEMAND  27  3-SPEED  BIKES 
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FALL  FIGURES:  DEMAND  25  DELUXE  BIKES 
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CASE  3  SCENARIO:  LONS  RANGE  PLANNING 

As  a  practical  business  person  you  realize  that  the 
summer’s  inventory  and  sales  may  have  a  significant  impact 
an  you r  Fall  purchasing  decisions.  Such  things  as  summer 
shortages,  overstocking,  warehouse  costs  and  costs  of 
changing  models  of  bikes  may  have  an  effect  on  what  bike 
model  becomes  most  profitable  to  stock  in  the  Fall.  After 
looking  into  these  particular  costs,  you  have  found  out 
the  following  facts: 

1.  Any  shortages  of  a  particular  stocked  model  from 
the  Summer  will  add  to  the  Fall  demand  if  the  model  is  not 
changed  between  seasons. 

2.  Any  overstocking  of  a  particular  model  from  the 
Summer  will  add  to  the  Fall  inventory  if  the  model  is  not 
changed  between  seasons. 

3.  If  the  model  is  changed  between  seasons,  any 
shortages  or  overstocking  will  not  affect  the  Fall 
figures.  Shortages  are  lost  business  and  overstocked 
bikes  will  be  donated  to  local  charity. 

4.  Changing  models  between  seasons  costs  $150.  This 
expense  covers  the  change  in  customer  literature, 
advertising,  display  racks,  etc. 

5.  A  friend's  warehouse  is  still  available  for 


storing  10  bikes  free.  However  if  the  total  inventory  is 
over  40,  you  have  to  contract  for  storage  space  for  all 
bikes  that  can  not  be  displayed  in  your  store.  The 
storage  fee  is  still  *5/bike. 

The  revenue/costs  and  profits  charts  have  been 
updated  to  reflect  the  impact  any  summer  inventory 
purchases  have  on  the  Fall  figures.  Using  these  graphs, 
answer  Case  3  Purchasing  Decision  questions. 
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CASE  3:  LONG  RANGE  PLANNING 


IMPACT  OF  SUMMER  INVENTORY  ON  FALL  PURCHASES 


1.  Whan  considering  the  Summer  purchase  inventory,  does  it  appear 
profitable  to  change  models  from  Summer  to  Fall? 

A.  Yes 

B.  No 


When  considering  the  impact  of  the  Summer  inventory,  what 
inventory  purchase  would  maximize  your  profits  in  the  Fall? 

A.  Campus/60 

B.  Campus/SO 

C.  3-Speed/ 20 

D.  3-Speed/ 40 

E.  Deluxe/0 

F.  Deluxe/ 20 

G.  Deluxe/40 


When  considering  the  Summer  inventory,  does  the  University’s  na 
policy  have  an  effect  on  your  Fall  purchase  decision? 

A.  Yes 

B.  No 


4.  When  considering  the  Summer  inventory  in  your  Fall  decision,  how 
much  is  your  friend’s  warehouse  offer  worth  to  you  in  dollars 
saved? 

A.  SO  His  warehouse  is  never  used 

B.  SIS 

C.  SSO 

D.  over  S63 


J 
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S.  The  Campus  modal  manuf acturer ’ s  representati va  promisas  that  her 
company  Mill  pay  all  change-over  expanses  <*130 )  necessary  to 
insure  your  shop  Mill  stock  Campus  bikes  in  tha  Fall.  Will  her 
offer  affect  your  purchase  decision? 

A.  No  Campus  Bikes  Mas  the  model  originally  planned  for 

purchase 

B.  No  Campus  Bikes  still  would  not  be  the  most  profitable 

to  stock 

C.  Yes  Eliminating  the  change-over  cost  for  Campus  Bikes 

makes  that  model  most  profitable 


6.  Your  regional  manager,  Ms.  Bucks,  just  called  and  Mants  you  to 
stock  tha  Deluxe  modal  for  both  the  Summer  and  Fall.  Would  it 
be  profitable  for  you  to  invest  *30  in  advertising  to  insure 
all  the  Deluxe  bikes  are  sold  by  the  end  of  tha  Fall  season? 

A.  No  The  mast  profitable  inventory  purchase  has 

shortages  so  advertising  Mould  be  an  added  expense 

B.  No  The  advertising  cost  would  NOT  save  over  *50  in 

overstocking  casts 

C.  Yes  The  advertising  cost  would  save  over  SSO  in 

overstocking  costs 


7.  When  considering  the  Summer  inventory  are  you  guaranteed  to  make 
a  profit  no  mattar  what  Fall  purchase  policy  you  chose? 

A.  Yes 

B.  No 


You  are  finished  with  the  interactive  decision  making  session. 
Please  raise  your  hand  to  indicate  to  the  Experimenter  you  are  ready 
for  the  final  questionnaire. 
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FALL  FIGURES:  DEMAND  67  CAMPUS  BIKES 
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FALL  FIGURES: 


DEMAND  67  CAMPUS  BIKES 


SUMMER  PURCHASE  40  CAMPUS  BIKES 
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FALL  FIGURES:  DEMAND  27  3-SPEED  BIKES 

SUMMER  PURCHRSE  NO  3-SPEED  BIKES 
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FALL  FIGURES:  DEMAND  27  3-SPEED  BIKES 

SUMMER  PURCHASE  40  3-SPEED  BIKES 

B  TOTAL  SRLES  PRICE  AT  *  50 /BIKE 

9  TOTRL  PURCHASE  COST  RT  *  30/BIKE 

B  TOTAL  SHORTAGE  COST  AT  *  20/BIKE 

DO  TOTAL  OVERSTOCK  COST  AT  *  10/BIKE 

M  TOTAL  WAREHOUSE  COST  AT  *  5 /BIKE  OVER  40 

S  TOTAL  CHANGE  MODEL  COST  *150 

*2100  - 

1800 
1500 
1200 
900 
600 
300 
*  0 
300 
600 
900 
*1200 


I 


MHMOn  PICZPKPUJ 


FALL  FIGURES:  DEMAND  25  DELUXE  BIKES 
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MEETING  WITH  MS.  BUCKS 


The  U-Sel 1 -Urn  regional  manager,  Me.  Bucks,  has  stopped  by  the 
shop  to  see  how  business  is  going.  She  is  interested  in  your  future 
plans.  She  has  contacted  the  manuf actureri  and  has  some  information 
concerning  possible  contracts  with  them.  She  wishes  to  discuss  the 
information  with  you  before  you  send  in  your  purchase  order. 


1.  When  only  the  summer  inventory  was  considered,  what  model 

appeared  the  most  profitable  to  stock  for  the  Summer? 

A.  Campus  bikes 

B.  3-Speed  Bikes 

C.  Deluxe  Bikes 

2.  When  taking  into  account  only  the  summer  inventory  and  demand, 

what  model  appeared  to  be  the  most  profitable  to  stock  for  the 

Fall? 

A.  Campus  Bikes 

B.  3-Speed  Bikes 

C.  Deluxe  Bikes 

3.  Ms.  Bucks  reminds  you  that  your  ultimate  goal  is  to  maximize  ' 

total  profits  over  both  seasons.  Which  of  the  following 

describes  your  purchase  decisions  for  the  next  two  seasons? 

(Circle  T  for  True,  F  for  False) 

T  F  Maximizing  each  season's  profits  maximizes  total 
profits, 

T  F  Warehouse  costs  are  the  significant  cost  factor  in 
determing  the  most  profitable  model (s)  to  stock  for  the 
next  two  seasons. 

T  F  Warehouse  costs  are  the  significant  cost  factor  in 
determining  the  most  profitable  quantities  to  stock  for 
the  next  two  seasons. 

T  F  When  total  supply  of  bikes  is  the  same  as  total 

demand,  maximum  profits  are  guaranteed  over  the  next  -  « 

two  seasons  if  overstocking  is  the  summer’s  pol  Icy. 
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4.  How  much  pro-fit  do  you  expect  to  make  ovar  tha  naxt  two  lauont? 

A.  Approx Imataly  *  800 

B.  Approx imataly  41000 

C.  Approx imataly  41200 

D.  Approx imataly  41400 

5.  What  modal  of  bike  do  you  plan  to  purchase  for  your  Summer 

inventory  in  order  to  maximize  your  total  profits? 

A.  Campus  Bikas 

B.  3-Speed  Bikas 

C.  Deluxe  Bikas 

6.  What  model  of  bike  do  you  plan  to  purchase  for  your  Fall 

inventory  in  order  to  maximize  your  total  profits? 

A.  Campus  Bikas 

B.  3-Speed  Bikas 

C.  Deluxe  Bikas 

7.  Is  your  friend’s  warahousa  offer  worth  enough  (over  40)  to  you 

to  mention  it  to  Ms.  Bucks? 

A.  Yes 

B.  No 

8.  What  was  the  overage  cost/bika  associated  with  the  3-Speed 

modal? 

A.  4  7/bike 

B.  410/bike 

C.  420/bi ka 

D.  430/bike 

E.  480/bike 

9.  What  was  the  shortage  cost/bika  associated  with  the  Deluxe 

model? 

A.  4  3/bike 

B.  410/bike 

C.  420/bike 

D.  440/bike 

E.  480/bike 

10.  The  dollar  values  printed  on  the  scale  of  the  bargraphs  in  each 
chart  were  in  multiples  of 

A.  4100 

B.  4200 

C.  4300 

D.  4300 


11.  Ms.  Bucks  forecasts  a  nsw  11  ns  of  bike  cosing  lats  Summer  or 
early  Fall.  Because  she  is  uncertain  when  they  will  became 
available  on  the  open  market,  she  wants  to  be  sure  you  have 
room  for  thee  as  she  feels  they  will  be  the  hottest  bike  ever. 
Therefore  she  insists  you  understock  both  seasons.  If  you  do, 
will  you  be  guaranteed  profits  no  matter  what  model (s)  you 
stock? 


A.  Yes  Understocking  is  the  rule  of  thumb  for  the  next  two 

seasons 

B.  Yes  Understocking  maximizes  profits  as  long  as  models 

are  not  changed  between  seasons 

C. '  No  Shortages  can  turn  profits  into  losses  even  when 

the  same  model  is  stacked  during  both  seasons 


Ms.  Bucks  has  to  leave  for  another  store.  She  thanks  you  for  your 
information  and  wishes  you  the  best  of  luck  over  the  next  two 
seasons. 


You  are  finished  with  this  experiment.  Thank  you  for  your  generous 
cooperation. 


Appendix  C 

Fortran/D  I  3000  Program 
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making  environment.  The  information  needed  is  presented 
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while  completing  part  of  the  case  study.  No  previous 
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Effects  of  Alternative  Chromatics  Graphic  Displays 
in  Decision  Support  Systems 


ABSTRACT 

This  article  describes  a  study  that  investigated  the  relative 
effectiveness  on  decision  makers'  performance  when  integrating  color 
functionally  into  mixed  text  and  graphics  displays  of  Decision  Support  Systems. 
Five  alternatives  of  color,  7  characteristics  of  decision  makers,  3  levels  of 
decision  time  and  6  levels  of  decision  quality  were  the  variables  considered  in 
the  study.  Sixty-three  hypotheses  were  tested  using  the  data  collected  from 
120  subjects  who  had  academic  engineering  backgrounds.  The  data  contained 
insufficient  evidence  at  the  .05  significance  level  to  conclude  that  color  has 
a  unique  quality  for  functionally  coding  information  in  a  DSS.  If  color  is 
incorporated  into  a  DSS,  those  colors  currently  preset  on  many  CRTs  appear 
best,  but  increased  efficiency  of  decision  makers  can  not  be  assumed. 

KEYWORDS:  computer  graphics,  MIS,  color  graphics,  decision  quality,  decision 
time,  information  recall. 


Effects  of  Alternative  Chromatic  Graphic  Displays 
in  Decision  Support  Systems 

Introduction 

Computer  generated  graphics  supporting  the  option  of  color  have  become  the 
heart  of  many  decision  support  systems  (DSS)  in  the  industrial,  business  and 
military  sectors.  Technological  advances,  the  intuitive  appeal  of  color  output 
and  the  increasing  availability  and  usability  of  hardware/ sof t war e  have 
resulted  in  a  steady  growth  in  the  application  of  color  to  information  systems. 
However,  these  systems: 

have  been  growing  seemingly  in  an  ad  hoc  manner. 

While  support  and  praise  of  these  systems  have 
been  relatively  high,  it  appears  that  efforts  have 
fallen  short  of  realizing  maximum  effectiveness 
and  efficiency  for  the  system  designed  (Dickson, 

Senn  and  Chervany,  1977,  p.  913). 

The  technical  development  of  the  color  graphical  displays  is  ahead  of  reported, 
quantitative  information  regarding  its  contribution  to  the  DSS.  The  challenge 
remains  to  ascertain  its  relative  effectiveness  on  user  performance. 


Background 

For  over  25  years,  color  has  been  used  as  a  method  of  information  input 
coding.  However,  empirical  evidence  and  guidelines  on  how  to  use  it 
effectively  are  sparse  and  very  narrow  in  scope.  In  fact,  the  literature  has 
Just  begun  to  address  how  color  impacts  computerized  decision  support  systems. 
The  theoretical  and  the  majority  of  the  subjective  literature  are  concurrent  in 
their  reports:  color  allows  for  relatively  more  efficient  processing  of 
information,  thus  improves  performance  over  monochrome  displays  (De  Mars,  1975; 
Friend,  1980;  Miller,  1982;  Morris,  1979;  Truckenbroad,  1981). 

The  majority  of  the  empirical  studies  based  their  conclusions  on  analysis 
of  performance  of  the  simple  discrete  tasks  of  search,  locate,  identify,  and 


count.  The  environments  were  typically  isolated  laboratory  settings  that  often 
simulated  an  aircraft  cockpit.  The  alternative  codes  to  which  color  was 
•compared  for  relative  effectiveness  included  letters,  numerals,  geometric  and 
abstract  shapes.  The  subjects,  whose  performance  constituted  the  data  for 
analysis,  were  usually  lacking  in  experience  with  the  given  task.  Without 
exception  the  measure  of  performance  was  speed  and/or  accuracy.  Findings  in 
these  experimental  literature  are  not  consistent  with  the  theoretical  or  the 
majority  of  subjective  reports.  The  empirical  results  can  be  summarized  by 
stating  the  relative  effectiveness  of  color  is  a  function  of  the  task 
undertaken.  Generally,  color  was  not  found  to  have  a  unique  quality  for  coding 
information  in  a  display  over  other  codes  (Barker  and  Krebs,  1977;  Christ, 
1977;  Christner  and  Ray,  1961;  Cook,  197U;  Tullis,  1981). 

There  continues  to  remain  glaring  gaps  in  the  literature  on  the  use  of 
color  in  visual  displays,  especially  those  which  do  not  involve  search,  count, 
locate  and  identify  type  of  tasks.  One  of  the  fastest  growing  sectors  of 
computer  graphic  users  are  managers  and  decision  makers  who  are  incorporating 
color  graphics  into  their  DSS  (Berk,  Brownston  and  Kaufman,  1982;  McLamb, 
1982).  The  question  is,  will  color  aid  managers  in  their  primary  function  of 
decision  making.  The  subjective  literature  purports  that  color  will  speed  up 
the  decision  making  process  significantly  as  well  as  improve  recall  or  memory 
(Durrett,  Zwiener  and  Freund,  1981;  Friend,  1980;  Miller,  1982;  Morris,  1979). 
However,  the  question  of  how  color  affects  speed  and  accuracy,  two  of  the  most 
coveted  resources  of  decision  makers,  has  not  been  thoroughly  addressed. 
Dooley  and  Harkins  (1970)  investigated  the  effect  color  had  on  attention  and 
memory,  when  incorporated  into  a  graphic  display  that  sat  in  the  corner  of  the 
experiment  room.  The  subjects  were  not  aware  the  graphic  display  was  a  part  of 
the  experiment.  The  study  presented  insufficient  evidence  to  conclude  color 


affected  memory  performance.  Tullis  (1981)  had  subjects  perform  a  training  and 
diagnostic  task  involving  the  status  of  telephone  lines.  The  graphics  were 
line  schematics  displayed  on  a  cathode  ray  tube  (CRT)  interface.  Conditions  of 
the  telephone  lines  were  coded  by  either  an  achromatic  shape  or  chromatic 
underlining  bar  (red,  yellow,  green).  The  task  involved  decision  making  but 
not  memory.  He  concluded  there  was  no  significant  differences  in  time  or 
accuracy  performance  between  the  color  underlining  bar  and  the  achromatic 
shapes . 

There  still  remains  much  to  be  learned  regarding  when  and  how  color  can  be 
used  to  increase  a  decision  maker’s  performance.  The  purpose  of  this  study  was 
to  investigate  the  relative  effectiveness  on  decision  maker’s  performance  when 
integrating  color  text  and  graphic  displays  into  typical  decision  support 
systems  (DSS). 


Conceptual  Framework 

The  DSS  investigated  in  this  study  was  composed  of  decision  makers  with 
academic  engineering  backgrounds.  They  solved  a  basic  engineering  case  study 
which  Involved  structured  problems  and  deterministic  data.  The  data  were 
presented  in  mixed  text  and  graphic  displays  as  exemplified  in  Figure  1. 
Computer  generated  hardcopy  was  used  to  allow  for  a  wide  variety  of  colors. 
The  decisional  setting  was  one  of  managerial  control  of  resources.  The 
variables  considered  were  classified  as  either  treatment  factors, 
characteristic  factors  or  performance  variables  as  delineated  in  Table  1. 

The  treatment  factors  included  five  alternative  sets  of  colors.  Each  set 
was  used  to  code  the  legend  squares  and  column  bars  in  each  of  the  graphic 
displays.  Traditional  uses  of  color  were  utilized,  that  is  green  represented  a 
revenue  while  red  coded  a  cost.  Colors  of  the  same  hue  were  not  placed 
adjacent  to  each  other.  Color  usage  was  compatable  across  all  displays  as  a 
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Table  1 


VARIABLES  for  ANALYSIS 


Treatment  Factor 

Color 

I CRT  Color  (CRT) 

Black  &  White  (B/W) 

Outline  (OUT) 

Random  (RND) 

Pastel  (PAS) 

Characteristic  Factor 

Age  (years) 

”  1 17-22 
23*28 
29* 

Sex 

I  Female 
|Male 

Years  of  College 
<  3 

>  3  and  <  5 

>  5 

Education  Level 
Fresh® an/Sophomore 
Junior/Senior 
Graduate 

CRT  Experience 
Not  a  User  (0-3  Never  Use) 

Moderate  User  (0-6  hrs/wk) 

Frequent  User  (>  6  hrs/wk) 

Attitude  toward  Computer  Use 
Uncomfortable 
Comfortable 
Very  Comfortable 

Video  Game  Experience 
I  Do  Not  or  Seldom  Play  (0-3  hrs/wk) 
Moderate  Player  (>  3  hrs/wk) 


Sample  Size 


Sample  Size 


Table  l  {cont.) 


VARIABLES  for  ANALYSIS 

Performance  Variables  Sample  Size 


TIME  (Seconds) 


Total  Time  120 

No  Memory  Time  120 

Memory  T-ime  1  20 

QUALITY  SCORE  (*) 


Overall  Score  1  20 
No  Memory  Score  1 20 
Combined  Memory  Score  120 
Memory  Only  Score  120 
Read  Only  Score  1  20 
Extrapolate  Information  Score  120 


particular  color  always  symbolized  the  same  information.  The  set  of  colors 
designated  CRT  were  the  primary  colors  preset  on  the  majority  of  color  CRTs. 
These  included  green,  red,  yellow,  blue,  turquoise,  cyan,  black,  and  white. 
The  second  set  designated  OUT  was  the  same  as  the  CRT  except  the  legend  squares 
and  columns  were  only  outlined  in  these  colors,  not  solidly  filled.  This 
treatment  was  selected  to  see  if  the  amount  of  color  in  the  output  had  any 
affect.  The  third  set,  RND,  consisted  of  a  random  selection  of  colors  that  are 
available,  either  primary  or  user  specified,  on  the  CRT  colorwheel.  This 
treatment  was  selected  to  investigate  whether  it  was  the  typical  stereotype 
colors  CRT  or  Just  the  presence  of  any  color  that  had  any  affect  on 
performance.  This  alternative  included  green,  hot  pink,  white,  turquoise, 
purple,  orange,  yellow  and  black.  The  fourth  treatment,  noted  as  PAS,  used  a 
random  selection  of  pastel  colors.  The  intensity  of  the  color  was 
significantly  less  than  that  in  the  CRT  or  RND  alternative.  The  afreet  of 
color  Intensity  was  the  test  objective  of  this  set  which  was  made  up  of  lime 
green,  pink,  yellow,  white,  purple,  grey,  blue  and  flesh.  The  monoerhome 
treatment,  designated  B/W,  was  used  to  investigate  the  most  fundamental  issue 
of  whether  the  use  of  chromatic  color  in  any  of  the  tested  configurations  had 
an  affect  on  decision  making.  For  this  alternative,  eight  of  the  patterns 
available  on  the  DI-3000  graphics  language  package  were  used  to  generate  the 
displays.  The  eight  patterns  selected  were  based  on  the  criterion  of  non¬ 
similarity  in  pattern. 

Seven  characteristics  of  decision  makers  that  may  have  influenced 
performance  were  included  in  the  design.  These  were  age,  sex,  years  of 
college,  education  level,  CRT  experience,  attitude  toward  the  computer  and 
experience  interfacing  with  computers  or  video  games.  The  specific  levels  and 


sample  size  of  each  are  shown  in  Table  1.  These  data  were  gathered  from  a 
questionnaire  each  subject  completed  prior  to  participating  in  the  experiment. 

The  performance  measured  were  time  and  quality  of  decision.  These  two 
variables  were  subdivided  into  9  categories  for  more  rigorous  analysis  of  the 
research  question.  As  indicated  in  table  1,  the  three  levels  of  time 
represented  the  time  taken  to  make:  1)  those  decisions  which  required  no 
memory,  2)  those  decisions,  some  of  which  required  memory  of  no  longer 
available  graphics  and  previous  decisions  and  3)  all  the  decisions  in  the 
study.  The  decision  quality  variables  measured  performance  ranging  from  the 
simplest  decisions  (direct  reading,  no  computation  and  no  memory)  to  the  most 
difficult  (computation  ,  memory  and  extraction)  as  well  as  overall  decision 
quality.  Each  level  of  decision  quality  is  defined  when  the  results  are 
discussed.  One  hundred  twenty  observations  were  collected  on  each  of  the  nine 
performance  variables. 

Method 

The  population  of  interest  is  decision  makers  who  have  an  academic 
engineering  background.  Many  of  the  subjects  had  been  or  were  presently  a 
member  of  the  engineering  workforce.  An  inventory  control  case  study  was 
selected  as  the  content  of  the  decision  process.  The  problems  were  based  on 
deterministic  facts. 

The  case  study  consisted  of  four  progressive  scenarios.  The  background 
facts  were  given  in  text  form.  The  decisions  needed  were  made  in  conjunction 
with  multiple  choice  questions.  Information  for  only  one  scenario  was 
available  at  a  time,  though  previous  facts  and  decisions  were  often  needed  to 
solve  the  currently  worked  scenario.  The  fourth  scenario  had  no  information 


graphic  displays.  The  average  time  spent  completing  the  case  study  was 
approximately  one  hour. 

The  experiment's  procedures  and  instrument  were  pretested  using  ten 
subjects.  Based  on  the  pretest,  the  procedure  was  modified  to  increase 
clarity.  One  hundred  twenty  subjects  were  selected  and  scheduled  for  the 
experiment.  The  treatments  were  completely  randomized  as  to  assignment  to  the 
subjects.  The  subjects  were  tested  for  color  deficiency  via  the  Dvorine  Psuedo 
Isochranatic  Test,  and  the  experiment  run. 

Analysis 

Hypotheses.  Hypotheses  were  formulated  that  Incorporated  each  of  the 
performance  variables  with  every  combination  of  treatment  and  characteristic 
factor.  Each  null  hypothesis  stated  that  no  difference  existed  between  the 
mean  performance  of  the  five  treatment  factors  listed  in  Table  1.  Thus, 
rejection  of  the  null  hypothesis  would  indicate  a  significant  difference 
between,  at  least,  the  worst  and  best  performance,  given  the  variability 
attributed  to  the  specified  characteristic  factor  had  been  taken  into  account. 
The  generic  null  hypothesis 

H(k)0:  u(i)  -  u(J)  for  all  i,j  i  -  CRT,RND,OUT,PAS,B/W 

J  •  CRT , RND  ,OUT , PAS , B/W 
k  -  Characteristic  Factor 

was  tested  for  the  9  alternative  measures  of  performance  and  7  characteristic 
factors  for  a  total  of  63  different  tests. 

The  analysis  of  the  collected  data  was  based  on  the  appropriate  statistics 
(F,  Brown-Forsythe  F,  Welch  W)  derived  from  the  2-factor  fixed  effect  ANOVA 
model.  Correlation  analysis  of  the  time  and  decision  quality  performance 
variables  resulted  in  insufficient  evidence  to  conclude  any  form  of  these  two 


variables  were  correlated.  The  underlying  criteria  of  independence,  normality, 
homoscedasticity  and  additivity  were  rigorously  examined  to  validate  the 
appropriate  model  and  statistic.  The  final  results  are  consolidated  in  Table 
2.  Entries  in  this  table  are  the  p  values  of  the  models  whose  performance 
variable  starts  the  row  and  whose  characteristic  factor  head  the  column. 
Interpretation  of  the  table  would  indicate,  for  example,  that  when  the 
variability  in  performance  due  to  age  was  accounted  for  one  could  conclude  that 
a  difference  exists  between  the  total  time  for  at  least  the  worst  and  best  of 
the  five  color  treatments  with  a  significance  level  greater  than  or  equal  to 
.25. 

For  the  models  which  contain  more  than  one  p  value/entry,  the  criterion  of 
additivity  could  pot  be  assumed.  These  p  values  correspond  to  those  calculated 
for  each  level  of  the  characteristic  factor  under  consideration  and  are 
reported  in  the  same  order  as  the  levels  are  listed  in  Table  1.  That  is,  for 
sex,  the  pairs  contain  the  p  values  for  female  and  male  respectively. 

The  p  value  entries  are  highlighted  for  those  models  which  the  Duncan  or 
Tukey-Kramer  statistic  Indicated  sufficient  evidence  to  reject  the  null 
hypothesis  of  equal  average  performances  across  all  five  color  treatments  at 
the  .05  significance  level.  A  detailed  discussion  of  these  highlighted  results 
follows. 

Decision  Time 

Total  Time  represented  the  time  (in  seconds)  needed  to  make  those 
decisions  where  the  basic  problem  facts  and  graphic  displays  were  available  for 
review.  In  eight  of  the  nine  models'  the  data  contained  insufficient  evidence 
to  conclude  chromatic  displays  differentially  affected  the  overall  time 
required  to  make  decisions.  For  those  decision  makers  who  had  between  three 
and  five  years  college  experience,  decision  time  was  shortest  when  the  visual 


Table  2 


Analysis  of  Main  Effect  of  Color 
p  Values 

Reject  Hypothesis  if  p  <  .05 


Factors 

Perf  orman  ce*,^«^w 

Color 

X 

Age 

Color 

X 

Sex 

Color 

X 

Col  Yr 

Color 

X 

Edu  Lv 

Color 

X 

CRT  Ex 

Color 

X 

Attit 

Color 

X 

V  Game 

TIME  (sec) 

Total 

.25 

.84 

.79 

.03* 

.50 

.71 

.55 

.32 

.70 

No  Memory 

.86 

.94 

.75 

.06 

.81 

.87 

.86 

.58 

.94 

Memory 

.39 

.24 

.15 

.39 

.40 

.12 

.44 

QUALITY  SCORE  (%) 

Overall 

.19 

.01* 

.12 

.22 

.26 

.15 

.13 

.03* 

No  Memory 

.24 

.13  . 

.13 

.11 

.06 

.61 

.43 

.21 

.06 

Combined 

•  32 

.00* 

.17 

.36 

.55 

.32 

.36 

.11 

Memory  Only 

.19 

.06 

.20 

.15 

.00* 

.38 

.10 

.05* 

Read  Only 

.53 

.20 

.06 

.38 

.92 

.26 

.25 

.22 

Extrapolate 

.64 

.28 

.23 

.18 

.11 

.21 

.20 

display  was  functionally  colored  with  the  RND  alternative.  The  B/W,  PAS,  CRT 
and  OUT  were  the  remaining  rank  ordering  as  shown  in  Table  3.  RND  outscored 
B/W  by  2.7  percent,  while  B/W  shortened  total  time  by  11.5  percent  over  CRT. 
RND  and  B/W  were  not  significantly  different  from  each  other,  nor  were  B/W, 
PAS,  CRT  and  OUT. 

No  memory  time  was  the  elapsed  seconds  when  formulating  decisions  which 
required  no  memory  but  extraction  of  Information  directly  available  in  the 
output  displays.  Results  from  analyses  were  consistent  across  every  case:  the 
use  or  lack  of  color  did  not  affect  no  memory  decision  time  across  the  five 
treatments  when  compared  at  the  .05  level  of  si&iificance. 

Memory  time  was  the  seconds  spent  when  making  decisions  which  scmetimes 
required  memory  of  past  displays  and  scmetimes  required  no  memory.  Analysis 
overwhelmingly  concluded  the  use  of  a  color  did  not  significantly  affect  memory 
time. 

Decision  Quality  Score 

Overall  Score  results  gave  sufficient  evidence  to  conclude  that  color 
Influenced  the  decision  quality  in  only  two  cases.  The  overall  decision 
quality  of  females,  as  shown  in  Table  4,  were  ranked  best  to  worst  as  CRT,  B/W, 
PAS,  RND  and  OUT.  Analysis  showed  insignificant  differences  between  the  first 
four  and  between  the  last  four  treatments.  The  CRT  treatment  yielded  1 6 . 5 
percent  increase  in  overall  score  over  the  B/W  treatment  and  a  46  percent 
increase  over  the  OUT  treatment.  These  results  did  not  hold  for  males.  When 
video  game  experience  was  accounted  for,  the  ranking  of  the  overall  scores  was 
CRT,  B/W,  RND,  OUT,  and  PAS  as  shown  in  Table  5.  The  first  four  were  not 
significantly  different  from  one  another,  nor  were  the  last  three.  The  CRT 
treatment  yielded  only  a  .7  percent  increase  in  overall  score  over  B/W  and  a  12 
percent  increase  over  PAS. 


Performance  Not  Equal 


Across  All  Treatments  at  p  <  .05 


Table  4 

ANALYSIS  OF  COLOR  for  FEMALES  vs  OVERALL  DECISION  SCORE 

Performance  Not  Equal 
Across  All  Treatments  at  p  <  .05 


^-^Color  Factor 

CRT 

B/W 

PAS 

RND 

OUT 

Perf  ormance 

BEST 

WORST 

Overall  Score  (%) 

66.6 

66.1 

61 .2 

53.3 

TUKEY-KRAJCR 

A 

A 

A 

TUKEY-KRAfCR 

GROUPINGS 


B 


B 


Table  5 


ANALYSIS  of  COLOR  x  V  GAME  EXPERIENCE  vs  OVERALL  DECISION  SCORE 

Performance  Not  Equal 
Across  All  Treatments  at  p  <  .05 


Table  6 

ANALYSIS  of  COLOR  x  FEMALE  vs  COMBINED  SCORE 

Performance  Not  Equal 
Across  All  Treatments  at  p  <  .05 


In  general,  overall  decision  quality  was  not  affected  by  the  existence  of 
color.  When  there  was  a  significant  difference,  the  rankings  were  similar  with 
chromatic  CRT  the  best  but  not  significantly  better  than  monochrome  B/W,  the 
second  best.  The  actual  average  increase  of  CRT  score  over  B/W  was  8.6 
percent. 

Mo  Memory  Scores  represented  decisions  in  which  all  information  was 
available  at  the  time  the  decisions  were  made.  The  analysis  indicated 
insufficient  evidence  to  conclude  the  color  treatment  had  a  significant  impact 
on  any  of  the  tested  circumstances. 

Combined  Memory  Score  represented  decisions  requiring  information,  some  of 
which  wap  available  at  the  time  of  the  decision  as  well  as  memory  of  previously 
presented  information.  All  models  except  female  performance  3howed 
indifference  to  the  color  treatment.  Females  performed  these  combined  memory 
decisions  best  to  worst  when  interfacing  with  CRT,  B/W,  PAS,  RND,  and  OUT 
respectively  as  shown  in  Table  6.  Analysis  indicated  Insignificant  groupings 
between  the  first  four  as  well  as  the  last  four  treatments.  However,  females 
performed  significantly  better  with  the  CRT  solid  filled  colors  than  with  the 
same  colors  in  outline  OUT  form.  Females  using  a  chromatic  CRT  display 
outscored  those  females  interfacing  with  a  monochrome  B/W  display  by  14.5 
percent  for  the  combined  memory  decisions.  This  difference  was  not 
statistically  significant. 

Memory  Only  Score  evaluated  decisions  after  all  graphic  displays  were 
shown  but  were  no  longer  available  for  reference.  These  decisions  had  to  be 
made  based  solely  on  the  memory  of  the  decision  maker.  The  analysis  indicated 
only  two  of  the  eight  models  contained  evidence  to  suggest  the  color  of  the 
graphic  display  affected  memory  only  decision  performance:  those  which 
accounted  for  the  variability  due  to  education  level  and  for  video  game 


experience.  As  indicated  in  Table  7  and  8,  the  ranking  of  treatments  was  in 
each:  CRT,  B/W,  OUT,  PAS  and  RND.  The  first  three  treatments  formed  an 
insignificantly  different  group  as  did  the  last  four.  After  the  variability  in 
performance  attributed  to  education  level  or  video  game  experience  was 
accounted  for,  chromatic  CRT  outscored  monochrome  B/W  by  approximately  4 
percent  and  7.8  percent  respectively.  This  represented  an  average  increase  of 
approximately  5.9  percent  in  memory  only  score  using  CRT  over  B/W.  However, 
these  differences  were  not  statistically  significant  at  the  .05  level.  The 
random  RND  chromatic  treatment  ranked  last,  with  an  average  of  17  percent 
decrease  in  memory  only  score. 

Read  Only  Score  represented  decisions  that  required  only  reading 
information  to  get  the  decision  directly  from  the  graphic  displays.  Neither 
memory  or  extrapolation  of  the  information  was  necessary.  None  of  the  eight 
models  Investigated  suggested  color  differentially  affected  how  the  decision 
maker  processed  the  graphical  Information. 

Extrapolation  Score  was  composed  of  the  decisions  which  required 
manipulation  or  forecasting  of  the  information  presented  in  one  or  more  graphic 
displays.  Analyses  overwhelmingly  indicated  insufficient  evidence  to  conclude 
the  colors,  hues  or  amount  of  color  differentially  affected  how  the  decision 
maker  reached  these  types  of  decisions. 

Summary 

Of  the  25  models  that  were  examined  whose  performance  was  a  function  of 
decision  time,  all  but  one  resulted  in  insufficient  evidence  to  conclude  that 
the  specific  colors,  the  hues  or  the  amount  of  color  in  a  mixed  graphic  display 
significantly  affected  decision  time  at  the  .05  level.  The  single  case  for 
which  analyses  indicated  a  difference  in  decision  time  attributed  to  color  is 
shown  in  Table  9.  The  best  chromatic  display  RND  decreased  decision  time  by 


ANALYSIS  of  COLOR  x  EDUCATION  LEVEL  vs  MEMORY  SCORE 

Performance  Not  Equal 
Across  All  Treatments  at  p  <  .05 


'"'•'•^^^Color  Factor 

CRT 

B/W 

OUT 

PAS 

RND 

Performance 

BEST 

WORST 

Memory  Score  (S) 

67. « 

62.6 

61.3 

59.6 

57.7 

ru  KEY- KRAMER 

A 

A 

A 

(SOUPING 

B 

B 

B 

B 

Table  8 

ANALYSIS  of  COLOR  X  V  GAME  EXPERIENCE  vs  MEMORY  SCORE 

Performance  Not  Equal 
Across  All  Treatments  at  p  <  .05 


Table  9 


RANKING  of  TREATMENTS 

b£ 

PERFORMANCE 

Groupings  of  Duncan/Tukey-Kramer 
a  -  .05 


Color  Ranking 

Characteristic  vs  Performance 

m 

2 

3 

4 

WORS' 

5 

TIME 

Col  Ex  vs  Total  Time 

I 

RND 

B/W 

PAS 

CRT 

OUT 

DECISION  QUALITY 

Female  vs  Overall  Score 

CRT 

B/W 

PAS 

RND 

our 

V  Game  vs  Overall  Score 

CRT 

B/W 

PAS 

OUT 

RND 

Female  vs  Combined  Mem 

CRT 

B/W 

PAS 

RND 

OUT 

Edu  Lv  vs  Memory  Only 

CRT 

B/W 

OUT 

PAS 

RND 

V  Game  vs  Memory  Only 

CRT 

B/W 

our 

PAS 

RND 

only  2.7  percent  over  the  monochrome  B/W  while  B/W  showed  a  decrease  in 
decision  time  by  11.5  percent  over  the  industry  standard  chromatic  scheme  of 
CRT.  These  results  are  inconsistent  with  the  theoretical  and  the  majority  of 
the  subjective  literature  which  contend  color  significantly  speed  the 
information  processing,  thus  decision  time. 

Of  the  42  models  that  were  examined  whose  performance  was  a  function  of 
quality  of  decision,  the  majority  37  of  them  resulted  in  insufficient  evidence 
to  conclude  the  specific  colors,  hue  or  amount  of  color  of  the  mixed  graphic 
display  signif icantly  affected  performance  at  the  .05  significance  level. 
These  results  are  inconsistent  with  the  theoretical  and  subject  literature, 
which  purport  color  when  Integrated  into  information  display  allows  for  better 
decisions.  However,  these  findings  are  concurrent  with  the  objective 
literature  which  was  summarized  by  stating  chromatic  color  does  not  appear  to 
have  a  unique  quality  for  coding  information  over  other  achromatic  codes 
(Christ  and  Corso,  1983).  For  sane  tasks  it  may  be  beneficial,  in  others  it 
may  be  detrimental.  In  this  research,  chromatic  color  did  not  influence 
performance  in  a  way  different  than  the  achromatic  code  for  a  majority  of  the 
models.  Where  it  did,  the  influence  was  always  in  the  positive  direction  as  it 
consistently  ranked  CRT  better  than  achromatic  B/W,  though  never  with  an 
interval  which  was  statistically  significant  at  the  .05  level. 

Unlike  any  other  study  reported  in  the  literature,  this  research  employed 
several  chromatic  alternatives  in  the  experimental  design.  This  was  done  to 
investigate  if  it  was  Just  the  presence  of  color  that  affected  performance  or  a 
particular  set.  As  discussed  previously,  decision  quality  performance  was  not 
affected  differently  by  any  of  the  five  alternatives.  Based  on  these  results, 
it  can  be  concluded  that  despite  the  color  set  used,  the  hue  or  the  amount  of 


color,  performance  was  not  affected  differently  than  when  a  monochrome 
intensity  pattern  was  used. 

In  the  five  models  whose  data  contained  sufficient  evidence  which 
suggested  chromatic  color  affected  decision  quality  performance,  a  pattern  was 
detected  as  displayed  in  Table  9.  In  all  five  models,  the  chromatic  CRT 
alternative  was  best,  indicating  an  average  of  approximately  8.7  percent 
increase  in  decision  quality  score  over  the  consistently  second  place 
monochrome  B/W  display.  This  difference  was  never  statistically  significant. 
Based  on  these  results,  it  is  concluded  that  if  a  color  alternative  is  chosen 
those  colors  preset  on  the  majority  of  CRTs  are  the  ones  which  result  in  the 
best  decision  quality.  However,  since  new  color  alternatives  are  becoming 
available  for  graphics  displays  and  not  all  color  alternatives  were 
investigated,  the  generalizaton  of  this  conclusion  is  limited. 

Overall  Conclusions  Generalized  to  Problem  Statement 

The  primary  objective  of  this  research  was  to  investigate  the  relative 
effectiveness  on  performance  when  functionally  integrating  color  into  a  DSS 
whose  decision  maker  has  an  academic  engineering  background  and  solves 
structured  problems  based  on  deterministic  data  presented  in  a  mixed  graphics 
form  on  a  computer  generated  hardcopy  in  a  managerial  role  of  control  of 
resources. 

Based  on  the  results  of  57  of  63  tested  hypotheses,  the  attribute  of  color 
when  functionally  integrated  in  a  mixed  graphics  hardcopy  display  did  not 
significantly  affect  performance.  In  these  test  cases,  quality  of  decision  was 
not  improved  or  degraded  significantly  at  the  .05  significance  level,  nor  was 
decision  time. 

In  the  6  models  where  the  data  contained  evidence  that  the  color 
alternative  affected  performance,  the  CRT  alternative  ranked  best  in  5 


instances,  all  which  were  a  function  of  decision  quality.  The  average 
improvement  was  approximately  8.7  percent  over  the  monochrome.  Specifically, 
females  performed  best  when  interacing  with  the  chromatic  CRT  alternative,  but 
it  was  never  a  significant  improvement  over  the  monochrome  B/W  intensity 
pattern.  In  the  other  model  where  a  difference  in  performance  was  suggested, 
the  chromatic  RND  displays  ranked  first,  with  the  monochrome  B/W  ranking  second 
relative  to  decision  making  time.  The  improvement  was  significant  with  respect 
to  the  monochrome.  Monochrome  intensity  pattern  showed  a  11.5  percent 
improvement  in  performance  time  over  the  CRT  alternative. 

In  general,  these  results  are  concurrent  with  other  empirical  studies  on 
the  effects  of  color  when  integrated  into  visual  displays.  The  data  do  not 
contain  sufficient  evidence  to  conclude  the  chromatic  colors  have  a  unique 
quality  for  functionally  coding  information  in  DSS  over  a  monochrome  intensity 
pattern.  However,  if  color  is  functionally  added  to  a  hardcopy  mixed  graphics 
display,  the  solid  filled  intense  hue  of  the  preset  colors  available  on  a 
majority  of  the  current  CRTs  appear  to  be  the  best  overall.  The  manager  should 
not  assume  increased  efficiency  in  a  decision  maker's  performance  because  o..‘ 
the  functional  color  attribute. 
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